no code implementations • 27 Aug 2019 • Dokook Choe, Rami Al-Rfou, Mandy Guo, Heeyoung Lee, Noah Constant
Purely character-based language models (LMs) have been lagging in quality on large scale datasets, and current state-of-the-art LMs rely on word tokenization.
1 code implementation • 9 Aug 2018 • Rami Al-Rfou, Dokook Choe, Noah Constant, Mandy Guo, Llion Jones
LSTMs and other RNN variants have shown strong performance on character-level language modeling.
Ranked #8 on Language Modelling on Hutter Prize