no code implementations • 29 Oct 2018 • Heng xin Fun, Sergiy V Bokhnyak, Francesco Saverio Zuppichini
In this paper we examine a possible reason for the LSTM outperforming the GRU on language modeling and more specifically machine translation.
Language Modelling Machine Translation +1