TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Question Answering	SQuAD1.1	QANet + data augmentation ×3	EM	76.2	# 111
Question Answering	SQuAD1.1	QANet + data augmentation ×3	F1	84.6	# 107
Question Answering	SQuAD1.1 dev	QANet (data aug x3)	EM	75.1	# 27
Question Answering	SQuAD1.1 dev	QANet (data aug x3)	F1	83.8	# 29
Question Answering	SQuAD1.1 dev	QANet (data aug x2)	EM	74.5	# 28
Question Answering	SQuAD1.1 dev	QANet (data aug x2)	F1	83.2	# 31
Question Answering	SQuAD1.1 dev	QANet	EM	73.6	# 30
Question Answering	SQuAD1.1 dev	QANet	F1	82.7	# 33

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/qanet-combining-local-convolution-with-global/question-answering-on-squad11-dev)](https://paperswithcode.com/sota/question-answering-on-squad11-dev?p=qanet-combining-local-convolution-with-global)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/qanet-combining-local-convolution-with-global/question-answering-on-squad11)](https://paperswithcode.com/sota/question-answering-on-squad11?p=qanet-combining-local-convolution-with-global)`

QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension

ICLR 2018 · Adams Wei Yu, David Dohan, Minh-Thang Luong, Rui Zhao, Kai Chen, Mohammad Norouzi, Quoc V. Le ·

Current end-to-end machine reading and question answering (Q\&A) models are primarily based on recurrent neural networks (RNNs) with attention. Despite their success, these models are often slow for both training and inference due to the sequential nature of RNNs. We propose a new Q\&A architecture called QANet, which does not require recurrent networks: Its encoder consists exclusively of convolution and self-attention, where convolution models local interactions and self-attention models global interactions. On the SQuAD dataset, our model is 3x to 13x faster in training and 4x to 9x faster in inference, while achieving equivalent accuracy to recurrent models. The speed-up gain allows us to train the model with much more data. We hence combine our model with data generated by backtranslation from a neural machine translation model. On the SQuAD dataset, our single model, trained with augmented data, achieves 84.6 F1 score on the test set, which is significantly better than the best published F1 score of 81.8.

PDF Abstract ICLR 2018 PDF ICLR 2018 Abstract

Code

Add Remove Mark official

BangLiu/QANet-PyTorch

120

andy840314/QANet-pytorch-

allenai/allennlp-reading-comprehens…

ewrfcas/QANet_keras

ni9elf/QANet

See all 15 implementations

Tasks

Add Remove

Machine Translation

Question Answering

Reading Comprehension

Translation

Datasets

SQuAD

TriviaQA

Results from the Paper

Edit

Ranked #27 on Question Answering on SQuAD1.1 dev

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Question Answering	SQuAD1.1	QANet + data augmentation ×3	EM	76.2	# 111	Compare
Question Answering	SQuAD1.1	QANet + data augmentation ×3	F1	84.6	# 107	Compare
Question Answering	SQuAD1.1 dev	QANet (data aug x3)	EM	75.1	# 27	Compare
Question Answering	SQuAD1.1 dev	QANet (data aug x3)	F1	83.8	# 29	Compare
Question Answering	SQuAD1.1 dev	QANet (data aug x2)	EM	74.5	# 28	Compare
Question Answering	SQuAD1.1 dev	QANet (data aug x2)	F1	83.2	# 31	Compare
Question Answering	SQuAD1.1 dev	QANet	EM	73.6	# 30	Compare
Question Answering	SQuAD1.1 dev	QANet	F1	82.7	# 33	Compare

Methods

Add Remove

Convolution

Edit Social Preview

QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove