TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Prediction	KTH	PredRNN++	PSNR	28.47	# 6
Video Prediction	KTH	PredRNN++	SSIM	0.865	# 7
Video Prediction	KTH	PredRNN++	Cond	10	# 1
Video Prediction	KTH	PredRNN++	Pred	20	# 1
Video Prediction	Moving MNIST	Causal LSTM	MSE	46.5	# 25
Video Prediction	Moving MNIST	Causal LSTM	MAE	106.8	# 18
Video Prediction	Moving MNIST	Causal LSTM	SSIM	0.898	# 20
Video Prediction	SynpickVP	PredRNN++	MSE	51.73	# 1
Video Prediction	SynpickVP	PredRNN++	PSNR	27.50	# 2
Video Prediction	SynpickVP	PredRNN++	SSIM	0.894	# 1
Video Prediction	SynpickVP	PredRNN++	LPIPS	0.053	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/predrnn-towards-a-resolution-of-the-deep-in/video-prediction-on-kth)](https://paperswithcode.com/sota/video-prediction-on-kth?p=predrnn-towards-a-resolution-of-the-deep-in)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/predrnn-towards-a-resolution-of-the-deep-in/video-prediction-on-synpickvp)](https://paperswithcode.com/sota/video-prediction-on-synpickvp?p=predrnn-towards-a-resolution-of-the-deep-in)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/predrnn-towards-a-resolution-of-the-deep-in/video-prediction-on-moving-mnist)](https://paperswithcode.com/sota/video-prediction-on-moving-mnist?p=predrnn-towards-a-resolution-of-the-deep-in)`

PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning

ICML 2018 · Yunbo Wang, Zhifeng Gao, Mingsheng Long, Jian-Min Wang, Philip S. Yu ·

We present PredRNN++, an improved recurrent network for video predictive learning. In pursuit of a greater spatiotemporal modeling capability, our approach increases the transition depth between adjacent states by leveraging a novel recurrent unit, which is named Causal LSTM for re-organizing the spatial and temporal memories in a cascaded mechanism. However, there is still a dilemma in video predictive learning: increasingly deep-in-time models have been designed for capturing complex variations, while introducing more difficulties in the gradient back-propagation. To alleviate this undesirable effect, we propose a Gradient Highway architecture, which provides alternative shorter routes for gradient flows from outputs back to long-range inputs. This architecture works seamlessly with causal LSTMs, enabling PredRNN++ to capture short-term and long-term dependencies adaptively. We assess our model on both synthetic and real video datasets, showing its ability to ease the vanishing gradient problem and yield state-of-the-art prediction results even in a difficult objects occlusion scenario.

PDF Abstract ICML 2018 PDF ICML 2018 Abstract

Code

Add Remove Mark official

Yunbo426/predrnn-pp official

245

thuml/predrnn-pytorch

401

mindspore-ai/models

219

Flunzmas/vp-suite

dzhv/Spatio-Temporal-mobile-traffic…

See all 11 implementations

Tasks

Add Remove

Video Prediction

Datasets

MNIST

KTH

Moving MNIST

SynPick

Results from the Paper

Edit

Ranked #1 on Video Prediction on KTH (Cond metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Prediction	KTH	PredRNN++	PSNR	28.47	# 6	Compare
			SSIM	0.865	# 7	Compare
			Cond	10	# 1	Compare
			Pred	20	# 1	Compare
Video Prediction	Moving MNIST	Causal LSTM	MSE	46.5	# 25	Compare
			MAE	106.8	# 18	Compare
			SSIM	0.898	# 20	Compare
Video Prediction	SynpickVP	PredRNN++	MSE	51.73	# 1	Compare
			PSNR	27.50	# 2	Compare
			SSIM	0.894	# 1	Compare
			LPIPS	0.053	# 2	Compare

Methods

Add Remove

LSTM • Sigmoid Activation • Tanh Activation

Edit Social Preview

PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove