TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Machine Translation	WMT2014 English-French	Noisy back-translation	BLEU score	45.6	# 2
Machine Translation	WMT2014 English-French	Noisy back-translation	SacreBLEU	43.8	# 2
Machine Translation	WMT2014 English-French	Noisy back-translation	Hardware Burden	180G	# 1
Machine Translation	WMT2014 English-French	Noisy back-translation	Operations per network pass	None	# 1
Machine Translation	WMT2014 English-German	Noisy back-translation	BLEU score	35.0	# 2
Machine Translation	WMT2014 English-German	Noisy back-translation	SacreBLEU	33.8	# 1
Machine Translation	WMT2014 English-German	Noisy back-translation	Hardware Burden	146G	# 1
Machine Translation	WMT2014 English-German	Noisy back-translation	Operations per network pass	None	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/understanding-back-translation-at-scale/machine-translation-on-wmt2014-english-french)](https://paperswithcode.com/sota/machine-translation-on-wmt2014-english-french?p=understanding-back-translation-at-scale)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/understanding-back-translation-at-scale/machine-translation-on-wmt2014-english-german)](https://paperswithcode.com/sota/machine-translation-on-wmt2014-english-german?p=understanding-back-translation-at-scale)`

Understanding Back-Translation at Scale

EMNLP 2018 · Sergey Edunov, Myle Ott, Michael Auli, David Grangier ·

An effective method to improve neural machine translation with monolingual data is to augment the parallel training corpus with back-translations of target language sentences. This work broadens the understanding of back-translation and investigates a number of methods to generate synthetic source sentences. We find that in all but resource poor settings back-translations obtained via sampling or noised beam outputs are most effective. Our analysis shows that sampling or noisy synthetic data gives a much stronger training signal than data generated by beam or greedy search. We also compare how synthetic data compares to genuine bitext and study various domain effects. Finally, we scale to hundreds of millions of monolingual sentences and achieve a new state of the art of 35 BLEU on the WMT'14 English-German test set.

PDF Abstract EMNLP 2018 PDF EMNLP 2018 Abstract

Code

Add Remove Mark official

pytorch/fairseq official

↳ Quickstart in

Colab

PyTorch Hub

29,183

facebookresearch/fairseq

29,185

valentinmace/noisy-text

Tasks

Add Remove

Machine Translation

Translation

Datasets

WMT 2014 Europarl

Results from the Paper

Edit

Ranked #2 on Machine Translation on WMT2014 English-German (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Machine Translation	WMT2014 English-French	Noisy back-translation	BLEU score	45.6	# 2	Compare
			SacreBLEU	43.8	# 2	Compare
			Hardware Burden	180G	# 1	Compare
			Operations per network pass	None	# 1	Compare
Machine Translation	WMT2014 English-German	Noisy back-translation	BLEU score	35.0	# 2	Compare
			SacreBLEU	33.8	# 1	Compare
			Hardware Burden	146G	# 1	Compare
			Operations per network pass	None	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Understanding Back-Translation at Scale

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove