TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Visual Question Answering (VQA)	COCO Visual Question Answering (VQA) real images 1.0 open ended	DMN+ [xiong2016dynamic]	Percentage correct	60.4	# 8
Visual Question Answering (VQA)	VQA v1 test-dev	DMN+	Accuracy	60.3	# 6
Visual Question Answering (VQA)	VQA v1 test-std	DMN+	Accuracy	60.4	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dynamic-memory-networks-for-visual-and/visual-question-answering-on-vqa-v1-test-std)](https://paperswithcode.com/sota/visual-question-answering-on-vqa-v1-test-std?p=dynamic-memory-networks-for-visual-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dynamic-memory-networks-for-visual-and/visual-question-answering-on-vqa-v1-test-dev)](https://paperswithcode.com/sota/visual-question-answering-on-vqa-v1-test-dev?p=dynamic-memory-networks-for-visual-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dynamic-memory-networks-for-visual-and/visual-question-answering-on-coco-visual-4)](https://paperswithcode.com/sota/visual-question-answering-on-coco-visual-4?p=dynamic-memory-networks-for-visual-and)`

Dynamic Memory Networks for Visual and Textual Question Answering

4 Mar 2016 · Caiming Xiong, Stephen Merity, Richard Socher ·

Neural network architectures with memory and attention mechanisms exhibit certain reasoning capabilities required for question answering. One such architecture, the dynamic memory network (DMN), obtained high accuracy on a variety of language tasks. However, it was not shown whether the architecture achieves strong results for question answering when supporting facts are not marked during training or whether it could be applied to other modalities such as images. Based on an analysis of the DMN, we propose several improvements to its memory and input modules. Together with these changes we introduce a novel input module for images in order to be able to answer visual questions. Our new DMN+ model improves the state of the art on both the Visual Question Answering dataset and the \babi-10k text question-answering dataset without supporting fact supervision.

PDF Abstract

Code

Add Remove Mark official

therne/dmn-tensorflow

240

ethancaballero/Improved-Dynamic-Mem…

168

dandelin/Dynamic-memory-networks-pl…

126

DongjunLee/dmn-tensorflow

imatge-upc/vqa-2016-cvprw

See all 11 implementations

Tasks

Add Remove

Question Answering

Visual Question Answering

Visual Question Answering (VQA)

Datasets

MS COCO

Visual Question Answering

bAbI

DAQUAR

Results from the Paper

Edit

Ranked #4 on Visual Question Answering (VQA) on VQA v1 test-std

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Visual Question Answering (VQA)	COCO Visual Question Answering (VQA) real images 1.0 open ended	DMN+ [xiong2016dynamic]	Percentage correct	60.4	# 8	Compare
Visual Question Answering (VQA)	VQA v1 test-dev	DMN+	Accuracy	60.3	# 6	Compare
Visual Question Answering (VQA)	VQA v1 test-std	DMN+	Accuracy	60.4	# 4	Compare

Methods

Add Remove

Dynamic Memory Network • GRU • Memory Network • Softmax

Edit Social Preview

Dynamic Memory Networks for Visual and Textual Question Answering

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove