TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Stock Trend Prediction	FI-2010	BL-GAM-RHN-7	F1 (H50)	0.8088	# 1
Stock Trend Prediction	FI-2010	BL-GAM-RHN-7	Accuracy (H50)	0.8202	# 1
Language Modelling	Penn Treebank (Character Level)	GAM-RHN-5	Bit per Character (BPC)	1.147	# 3
Language Modelling	Penn Treebank (Character Level)	GAM-RHN-5	Number of params	16.0M	# 6
Sequential Image Classification	Sequential MNIST	GAM-RHN-1	Permuted Accuracy	96.8%	# 17
Language Modelling	Text8	GAM-RHN-10	Bit per Character (BPC)	1.157	# 12
Language Modelling	Text8	GAM-RHN-10	Number of params	44.7M	# 11

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/recurrent-highway-networks-with-grouped/stock-trend-prediction-on-fi-2010)](https://paperswithcode.com/sota/stock-trend-prediction-on-fi-2010?p=recurrent-highway-networks-with-grouped)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/recurrent-highway-networks-with-grouped/language-modelling-on-penn-treebank-character)](https://paperswithcode.com/sota/language-modelling-on-penn-treebank-character?p=recurrent-highway-networks-with-grouped)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/recurrent-highway-networks-with-grouped/language-modelling-on-text8)](https://paperswithcode.com/sota/language-modelling-on-text8?p=recurrent-highway-networks-with-grouped)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/recurrent-highway-networks-with-grouped/sequential-image-classification-on-sequential)](https://paperswithcode.com/sota/sequential-image-classification-on-sequential?p=recurrent-highway-networks-with-grouped)`

Recurrent Highway Networks with Grouped Auxiliary Memory

IEEE Access 2019 · Wei Luo ; Feng Yu ·

Recurrent neural networks (RNNs) are challenging to train, let alone those with deep spatial structures. Architectures built upon highway connections such as Recurrent Highway Network (RHN) were developed to allow larger step-to-step transition depth, leading to more expressive models. However, problems that require capturing long-term dependencies still can not be well addressed by these models. Moreover, the ability to keep long-term memories tends to diminish when the spatial depth increases, since deeper structure may accelerate gradient vanishing. In this paper, we address these issues by proposing a novel RNN architecture based on RHN, namely the Recurrent Highway Network with Grouped Auxiliary Memory (GAM-RHN). The proposed architecture interconnects the RHN with a set of auxiliary memory units specifically for storing long-term information via reading and writing operations, which is analogous to Memory Augmented Neural Networks (MANNs). Experimental results on artificial long time lag tasks show that GAM-RHNs can be trained efficiently while being deep in both time and space. We also evaluate the proposed architecture on a variety of tasks, including language modeling, sequential image classification, and financial market forecasting. The potential of our approach is demonstrated by achieving state-of-the-art results on these tasks.

PDF

Code

Add Remove Mark official

WilliamRo/gam_rhn

Tasks

Add Remove

Image Classification

Language Modelling

Sequential Image Classification

Stock Trend Prediction

Datasets

MNIST

Penn Treebank Text8

Results from the Paper

Add Remove

Ranked #1 on Stock Trend Prediction on FI-2010

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Stock Trend Prediction	FI-2010	BL-GAM-RHN-7	F1 (H50)	0.8088	# 1	Compare
Stock Trend Prediction	FI-2010	BL-GAM-RHN-7	Accuracy (H50)	0.8202	# 1	Compare
Language Modelling	Penn Treebank (Character Level)	GAM-RHN-5	Bit per Character (BPC)	1.147	# 3	Compare
Language Modelling	Penn Treebank (Character Level)	GAM-RHN-5	Number of params	16.0M	# 6	Compare
Sequential Image Classification	Sequential MNIST	GAM-RHN-1	Permuted Accuracy	96.8%	# 17	Compare
Language Modelling	Text8	GAM-RHN-10	Bit per Character (BPC)	1.157	# 12	Compare
Language Modelling	Text8	GAM-RHN-10	Number of params	44.7M	# 11	Compare

Methods

Add Remove

Highway Layer • Highway Network • Sigmoid Activation

Edit Social Preview

Recurrent Highway Networks with Grouped Auxiliary Memory

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove