Recurrent Neural Networks

Pointer Sentinel-LSTM

Introduced by Merity et al. in Pointer Sentinel Mixture Models

The Pointer Sentinel-LSTM mixture model is a type of recurrent neural network that combines the advantages of standard softmax classifiers with those of a pointer component for effective and efficient language modeling. Rather than relying on the RNN hidden state to decide when to use the pointer, the model allows the pointer component itself to decide when to use the softmax vocabulary through a sentinel.

Source: Pointer Sentinel Mixture Models

Papers


Paper Code Results Date Stars

Tasks


Task Papers Share
Language Modelling 1 100.00%

Categories