TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Machine Translation	WMT2014 English-German	Levenshtein Transformer (distillation)	BLEU score	27.27	# 53
Machine Translation	WMT2016 Romanian-English	Levenshtein Transformer (distillation)	BLEU score	33.26	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/levenshtein-transformer/machine-translation-on-wmt2016-romanian)](https://paperswithcode.com/sota/machine-translation-on-wmt2016-romanian?p=levenshtein-transformer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/levenshtein-transformer/machine-translation-on-wmt2014-english-german)](https://paperswithcode.com/sota/machine-translation-on-wmt2014-english-german?p=levenshtein-transformer)`

Levenshtein Transformer

NeurIPS 2019 · Jiatao Gu, Changhan Wang, Jake Zhao ·

Modern neural sequence generation models are built to either generate tokens step-by-step from scratch or (iteratively) modify a sequence of tokens bounded by a fixed length. In this work, we develop Levenshtein Transformer, a new partially autoregressive model devised for more flexible and amenable sequence generation. Unlike previous approaches, the atomic operations of our model are insertion and deletion. The combination of them facilitates not only generation but also sequence refinement allowing dynamic length changes. We also propose a set of new training techniques dedicated at them, effectively exploiting one as the other's learning signal thanks to their complementary nature. Experiments applying the proposed model achieve comparable performance but much-improved efficiency on both generation (e.g. machine translation, text summarization) and refinement tasks (e.g. automatic post-editing). We further confirm the flexibility of our model by showing a Levenshtein Transformer trained by machine translation can straightforwardly be used for automatic post-editing.

PDF Abstract NeurIPS 2019 PDF NeurIPS 2019 Abstract

Code

Add Remove Mark official

pytorch/fairseq official

29,251

ictnlp/Seq-NAT

maxwell1447/fairseq

Tasks

Add Remove

Automatic Post-Editing

Machine Translation

Text Summarization

Translation

Datasets

WMT 2014

WMT 2016

WMT 2016 News

Results from the Paper

Edit

Ranked #6 on Machine Translation on WMT2016 Romanian-English

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Machine Translation	WMT2014 English-German	Levenshtein Transformer (distillation)	BLEU score	27.27	# 53		Compare
Machine Translation	WMT2016 Romanian-English	Levenshtein Transformer (distillation)	BLEU score	33.26	# 6		Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Levenshtein Transformer • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • ReLU • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Levenshtein Transformer

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove