TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Visual Question Answering (VQA)	CLEVR	NS-VQA (1K programs)	Accuracy	99.8	# 1
Visual Question Answering (VQA)	CLEVR-Humans	NS-VQA (1K programs)	Accuracy	67.8	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-symbolic-vqa-disentangling-reasoning/visual-question-answering-on-clevr)](https://paperswithcode.com/sota/visual-question-answering-on-clevr?p=neural-symbolic-vqa-disentangling-reasoning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-symbolic-vqa-disentangling-reasoning/visual-question-answering-on-clevr-humans)](https://paperswithcode.com/sota/visual-question-answering-on-clevr-humans?p=neural-symbolic-vqa-disentangling-reasoning)`

Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding

NeurIPS 2018 · Kexin Yi, Jiajun Wu, Chuang Gan, Antonio Torralba, Pushmeet Kohli, Joshua B. Tenenbaum ·

We marry two powerful ideas: deep representation learning for visual recognition and language understanding, and symbolic program execution for reasoning. Our neural-symbolic visual question answering (NS-VQA) system first recovers a structural scene representation from the image and a program trace from the question. It then executes the program on the scene representation to obtain an answer. Incorporating symbolic structure as prior knowledge offers three unique advantages. First, executing programs on a symbolic space is more robust to long program traces; our model can solve complex reasoning tasks better, achieving an accuracy of 99.8% on the CLEVR dataset. Second, the model is more data- and memory-efficient: it performs well after learning on a small number of training data; it can also encode an image into a compact representation, requiring less storage than existing methods for offline question answering. Third, symbolic program execution offers full transparency to the reasoning process; we are thus able to interpret and diagnose each execution step.

PDF Abstract NeurIPS 2018 PDF NeurIPS 2018 Abstract

Code

Add Remove Mark official

kexinyi/ns-vqa

254

nerdimite/neuro-symbolic-ai-soc

↳ Quickstart in

Colab

Tasks

Add Remove

Question Answering

Representation Learning

Visual Question Answering

Visual Question Answering (VQA)

Datasets

Visual Question Answering

CLEVR CLEVR-Humans

Results from the Paper

Edit

Ranked #1 on Visual Question Answering (VQA) on CLEVR

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Visual Question Answering (VQA)	CLEVR	NS-VQA (1K programs)	Accuracy	99.8	# 1		Compare
Visual Question Answering (VQA)	CLEVR-Humans	NS-VQA (1K programs)	Accuracy	67.8	# 4		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove