Neural Machine Translation in Linear Time

31 Oct 2016Nal KalchbrennerLasse EspeholtKaren SimonyanAaron van den OordAlex GravesKoray Kavukcuoglu

We present a novel neural network for processing sequences. The ByteNet is a one-dimensional convolutional neural network that is composed of two parts, one to encode the source sequence and the other to decode the target sequence... (read more)

PDF Abstract
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Language Modelling Hutter Prize Bytenet decoder (Kalchbrenner et al., 2016) Bit per Character (BPC) 1.31 # 11
Machine Translation WMT2014 English-French ByteNet BLEU score 23.8 # 34
Machine Translation WMT2014 English-German ByteNet BLEU score 23.75 # 30
Machine Translation WMT2015 English-German ByteNet BLEU score 26.3 # 1

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet