TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Question Answering	CNN / Daily Mail	BiDAF	CNN	76.9	# 4
Question Answering	CNN / Daily Mail	BiDAF	Daily Mail	79.6	# 2
Question Answering	MS MARCO	BiDaF Baseline	Rouge-L	23.96	# 4
Question Answering	MS MARCO	BiDaF Baseline	BLEU-1	10.64	# 4
Question Answering	NarrativeQA	BiDAF	BLEU-1	33.45	# 8
Question Answering	NarrativeQA	BiDAF	BLEU-4	15.69	# 7
Question Answering	NarrativeQA	BiDAF	METEOR	15.68	# 7
Question Answering	NarrativeQA	BiDAF	Rouge-L	36.74	# 8
Open-Domain Question Answering	Quasar	BiDAF	EM (Quasar-T)	25.9	# 6
Open-Domain Question Answering	Quasar	BiDAF	F1 (Quasar-T)	28.5	# 5
Question Answering	SQuAD1.1	BiDAF (single model)	EM	67.974	# 170
Question Answering	SQuAD1.1	BiDAF (single model)	F1	77.323	# 175
Question Answering	SQuAD1.1	BiDAF (ensemble)	EM	73.744	# 134
Question Answering	SQuAD1.1	BiDAF (ensemble)	F1	81.525	# 142
Question Answering	SQuAD1.1 dev	BIDAF (single)	EM	67.7	# 41
Question Answering	SQuAD1.1 dev	BIDAF (single)	F1	77.3	# 44

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bidirectional-attention-flow-for-machine/question-answering-on-cnn-daily-mail)](https://paperswithcode.com/sota/question-answering-on-cnn-daily-mail?p=bidirectional-attention-flow-for-machine)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bidirectional-attention-flow-for-machine/question-answering-on-ms-marco)](https://paperswithcode.com/sota/question-answering-on-ms-marco?p=bidirectional-attention-flow-for-machine)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bidirectional-attention-flow-for-machine/open-domain-question-answering-on-quasar)](https://paperswithcode.com/sota/open-domain-question-answering-on-quasar?p=bidirectional-attention-flow-for-machine)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bidirectional-attention-flow-for-machine/question-answering-on-narrativeqa)](https://paperswithcode.com/sota/question-answering-on-narrativeqa?p=bidirectional-attention-flow-for-machine)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bidirectional-attention-flow-for-machine/question-answering-on-squad11-dev)](https://paperswithcode.com/sota/question-answering-on-squad11-dev?p=bidirectional-attention-flow-for-machine)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bidirectional-attention-flow-for-machine/question-answering-on-squad11)](https://paperswithcode.com/sota/question-answering-on-squad11?p=bidirectional-attention-flow-for-machine)`

Bidirectional Attention Flow for Machine Comprehension

5 Nov 2016 · Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi ·

Machine comprehension (MC), answering a query about a given context paragraph, requires modeling complex interactions between the context and the query. Recently, attention mechanisms have been successfully extended to MC. Typically these methods use attention to focus on a small portion of the context and summarize it with a fixed-size vector, couple attentions temporally, and/or often form a uni-directional attention. In this paper we introduce the Bi-Directional Attention Flow (BIDAF) network, a multi-stage hierarchical process that represents the context at different levels of granularity and uses bi-directional attention flow mechanism to obtain a query-aware context representation without early summarization. Our experimental evaluations show that our model achieves the state-of-the-art results in Stanford Question Answering Dataset (SQuAD) and CNN/DailyMail cloze test.

PDF Abstract

Code

Add Remove Mark official

allenai/bi-att-flow official

1,525

baidu/DuReader

1,102

galsang/BiDAF-pytorch

243

davidgolub/QuestionGeneration

110

allenai/allennlp-reading-comprehens…

See all 26 implementations

Tasks

Add Remove

Cloze Test

Open-Domain Question Answering

Question Answering

Reading Comprehension

Datasets

SQuAD

Visual Question Answering

MS MARCO

CNN/Daily Mail

NarrativeQA

QUASAR-T

QUASAR

Results from the Paper

Edit

Ranked #4 on Question Answering on MS MARCO

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Question Answering	CNN / Daily Mail	BiDAF	CNN	76.9	# 4	Compare
Question Answering	CNN / Daily Mail	BiDAF	Daily Mail	79.6	# 2	Compare
Question Answering	MS MARCO	BiDaF Baseline	Rouge-L	23.96	# 4	Compare
Question Answering	MS MARCO	BiDaF Baseline	BLEU-1	10.64	# 4	Compare
Question Answering	NarrativeQA	BiDAF	BLEU-1	33.45	# 8	Compare
			BLEU-4	15.69	# 7	Compare
			METEOR	15.68	# 7	Compare
			Rouge-L	36.74	# 8	Compare
Open-Domain Question Answering	Quasar	BiDAF	EM (Quasar-T)	25.9	# 6	Compare
Open-Domain Question Answering	Quasar	BiDAF	F1 (Quasar-T)	28.5	# 5	Compare
Question Answering	SQuAD1.1	BiDAF (single model)	EM	67.974	# 170	Compare
Question Answering	SQuAD1.1	BiDAF (single model)	F1	77.323	# 175	Compare
Question Answering	SQuAD1.1	BiDAF (ensemble)	EM	73.744	# 134	Compare
Question Answering	SQuAD1.1	BiDAF (ensemble)	F1	81.525	# 142	Compare
Question Answering	SQuAD1.1 dev	BIDAF (single)	EM	67.7	# 41	Compare
Question Answering	SQuAD1.1 dev	BIDAF (single)	F1	77.3	# 44	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Bidirectional Attention Flow for Machine Comprehension

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove