TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Language Modelling	Penn Treebank (Word Level)	Zaremba et al. (2014) - LSTM (large)	Validation perplexity	82.2	# 30
Language Modelling	Penn Treebank (Word Level)	Zaremba et al. (2014) - LSTM (large)	Test perplexity	78.4	# 36
Language Modelling	Penn Treebank (Word Level)	Zaremba et al. (2014) - LSTM (medium)	Validation perplexity	86.2	# 31
Language Modelling	Penn Treebank (Word Level)	Zaremba et al. (2014) - LSTM (medium)	Test perplexity	82.7	# 39
Machine Translation	WMT2014 English-French	Regularized LSTM	BLEU score	29.03	# 50

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/recurrent-neural-network-regularization/language-modelling-on-penn-treebank-word)](https://paperswithcode.com/sota/language-modelling-on-penn-treebank-word?p=recurrent-neural-network-regularization)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/recurrent-neural-network-regularization/machine-translation-on-wmt2014-english-french)](https://paperswithcode.com/sota/machine-translation-on-wmt2014-english-french?p=recurrent-neural-network-regularization)`

Recurrent Neural Network Regularization

8 Sep 2014 · Wojciech Zaremba, Ilya Sutskever, Oriol Vinyals ·

We present a simple regularization technique for Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) units. Dropout, the most successful technique for regularizing neural networks, does not work well with RNNs and LSTMs. In this paper, we show how to correctly apply dropout to LSTMs, and show that it substantially reduces overfitting on a variety of tasks. These tasks include language modeling, speech recognition, image caption generation, and machine translation.

PDF Abstract

Code

Add Remove Mark official

wojzaremba/lstm official

659

martin-gorner/tensorflow-rnn-shakes…

538

isi-nlp/Zoph_RNN

172

floydhub/word-language-model

sebastianGehrmann/tensorflow-stater…

See all 21 implementations

Tasks

Add Remove

Caption Generation

Image Captioning

Language Modelling

Machine Translation

Speech Recognition

Translation

Datasets

Penn Treebank

WMT 2014

Results from the Paper

Edit

Ranked #36 on Language Modelling on Penn Treebank (Word Level)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Language Modelling	Penn Treebank (Word Level)	Zaremba et al. (2014) - LSTM (large)	Validation perplexity	82.2	# 30	Compare
Language Modelling	Penn Treebank (Word Level)	Zaremba et al. (2014) - LSTM (large)	Test perplexity	78.4	# 36	Compare
Language Modelling	Penn Treebank (Word Level)	Zaremba et al. (2014) - LSTM (medium)	Validation perplexity	86.2	# 31	Compare
Language Modelling	Penn Treebank (Word Level)	Zaremba et al. (2014) - LSTM (medium)	Test perplexity	82.7	# 39	Compare
Machine Translation	WMT2014 English-French	Regularized LSTM	BLEU score	29.03	# 50	Compare

Methods

Add Remove

Dropout

Edit Social Preview

Recurrent Neural Network Regularization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove