Unsupervised Statistical Machine Translation

EMNLP 2018 Mikel ArtetxeGorka LabakaEneko Agirre

While modern machine translation has relied on large parallel corpora, a recent line of work has managed to train Neural Machine Translation (NMT) systems from monolingual corpora only (Artetxe et al., 2018c; Lample et al., 2018). Despite the potential of this approach for low-resource settings, existing systems are far behind their supervised counterparts, limiting their practical interest... (read more)

PDF Abstract
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Machine Translation WMT2014 English-French SMT + iterative backtranslation (unsupervised) BLEU score 26.22 # 32
Machine Translation WMT2014 English-German SMT + iterative backtranslation (unsupervised) BLEU score 14.08 # 44
Machine Translation WMT2014 French-English SMT + iterative backtranslation (unsupervised) BLEU score 25.87 # 1
Unsupervised Machine Translation WMT2014 French-English SMT BLEU 25.9 # 7
Machine Translation WMT2014 German-English SMT + iterative backtranslation (unsupervised) BLEU score 17.43 # 8
Machine Translation WMT2016 English-German SMT + iterative backtranslation (unsupervised) BLEU score 18.23 # 7
Machine Translation WMT2016 German-English SMT + iterative backtranslation (unsupervised) BLEU score 23.05 # 3

Methods used in the Paper