TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Machine Translation	IWSLT2015 German-English	Conv-LSTM (deep+pos)	BLEU score	30.4	# 7
Machine Translation	WMT2014 English-French	Deep Convolutional Encoder; single-layer decoder	BLEU score	35.7	# 45
Machine Translation	WMT2016 English-Romanian	BiLSTM	BLEU score	27.5	# 15
Machine Translation	WMT2016 English-Romanian	Deep Convolutional Encoder; single-layer decoder	BLEU score	27.8	# 14

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-convolutional-encoder-model-for-neural/machine-translation-on-iwslt2015-german)](https://paperswithcode.com/sota/machine-translation-on-iwslt2015-german?p=a-convolutional-encoder-model-for-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-convolutional-encoder-model-for-neural/machine-translation-on-wmt2016-english-1)](https://paperswithcode.com/sota/machine-translation-on-wmt2016-english-1?p=a-convolutional-encoder-model-for-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-convolutional-encoder-model-for-neural/machine-translation-on-wmt2014-english-french)](https://paperswithcode.com/sota/machine-translation-on-wmt2014-english-french?p=a-convolutional-encoder-model-for-neural)`

A Convolutional Encoder Model for Neural Machine Translation

ACL 2017 · Jonas Gehring, Michael Auli, David Grangier, Yann N. Dauphin ·

The prevalent approach to neural machine translation relies on bi-directional LSTMs to encode the source sentence. In this paper we present a faster and simpler architecture based on a succession of convolutional layers. This allows to encode the entire source sentence simultaneously compared to recurrent networks for which computation is constrained by temporal dependencies. On WMT'16 English-Romanian translation we achieve competitive accuracy to the state-of-the-art and we outperform several recently published results on the WMT'15 English-German task. Our models obtain almost the same accuracy as a very deep LSTM setup on WMT'14 English-French translation. Our convolutional encoder speeds up CPU decoding by more than two times at the same or higher accuracy as a strong bi-directional LSTM baseline.

PDF Abstract ACL 2017 PDF ACL 2017 Abstract

Code

Add Remove Mark official

facebookresearch/fairseq

29,224

siyuofzhou/CNNSeqToSeq

Tasks

Add Remove

Machine Translation

Sentence

Translation

Datasets

WMT 2014

WMT 2016

WMT 2016 News

Results from the Paper

Add Remove

Ranked #7 on Machine Translation on IWSLT2015 German-English

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Machine Translation	IWSLT2015 German-English	Conv-LSTM (deep+pos)	BLEU score	30.4	# 7	Compare
Machine Translation	WMT2014 English-French	Deep Convolutional Encoder; single-layer decoder	BLEU score	35.7	# 45	Compare
Machine Translation	WMT2016 English-Romanian	BiLSTM	BLEU score	27.5	# 15	Compare
Machine Translation	WMT2016 English-Romanian	Deep Convolutional Encoder; single-layer decoder	BLEU score	27.8	# 14	Compare

Methods

Add Remove

LSTM • Sigmoid Activation • Tanh Activation

Edit Social Preview

A Convolutional Encoder Model for Neural Machine Translation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove