Video Prediction

183 papers with code • 19 benchmarks • 24 datasets

Video Prediction is the task of predicting future frames given past video frames.

Gif credit: MAGVIT

Source: Photo-Realistic Video Prediction on Natural Videos of Largely Changing Frames

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Prediction

Dataset	Best Model	Compare
KTH	Grid-keypoints	See all
Moving MNIST	SimVP+gSTA-Sx10	See all
Kinetics-600 12 frames, 64x64	MAGVIT-v2	See all
Human3.6M	IAM4VP	See all
BAIR Robot Pushing	MAGVIT (-L-FP)	See all
Cityscapes 128x128	GHVAEs	See all
SynpickVP	MSPred	See all
CMU Mocap-2	Latent SDE	See all
Cityscapes	DMVFN	See all
KITTI	DMVFN	See all
CMU Mocap-1	ODE2VAE-KL	See all
DAVIS 2017	DMVFN	See all
Vimeo90K	OPT	See all
Colored dSprites	MGP-VAE (with geodesic loss)	See all
Sprites	MGP-VAE (with geodesic loss)	See all
YouTube-8M	SDCNet	See all
KTH 64x64 cond10 pred30	SRVP	See all
Something-Something V2	MAGVIT	See all
MPI Sintel	MCnet [villegas2017mcnet]	See all

Show all 19 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Video Prediction models and implementations

chengtan9907/simvpv2

10 papers

566

chengtan9907/OpenSTL

4 papers

566

Flunzmas/vp-suite

3 papers

tensorflow/tensor2tensor

2 papers

14,864

See all 6 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting

ndrplz/ConvLSTM_pytorch • • NeurIPS 2015

The goal of precipitation nowcasting is to predict the future rainfall intensity in a local region over a relatively short period of time.

Paper
Code

Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning

coxlab/prednet • • 25 May 2016

Here, we explore prediction of future frames in a video sequence as an unsupervised learning rule for learning about the structure of the visual world.

Paper
Code

PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning

Yunbo426/predrnn-pp • • ICML 2018

We present PredRNN++, an improved recurrent network for video predictive learning.

Paper
Code

Video-to-Video Synthesis

NVIDIA/vid2vid • • NeurIPS 2018

We study the problem of video-to-video synthesis, whose goal is to learn a mapping function from an input source video (e. g., a sequence of semantic segmentation masks) to an output photorealistic video that precisely depicts the content of the source video.

Paper
Code

MogaNet: Multi-order Gated Aggregation Network

Westlake-AI/openmixup • • 7 Nov 2022

Notably, MogaNet hits 80. 0\% and 87. 8\% accuracy with 5. 2M and 181M parameters on ImageNet-1K, outperforming ParC-Net and ConvNeXt-L, while saving 59\% FLOPs and 17M parameters, respectively.

Paper
Code

Deep multi-scale video prediction beyond mean square error

dyelax/Adversarial_Video_Generation • • 17 Nov 2015

Learning to predict future images from a video sequence involves the construction of an internal representation that models the image evolution accurately, and therefore, to some degree, its content and dynamics.

Paper
Code

The "something something" video database for learning and evaluating visual common sense

jayleicn/singularity • • ICCV 2017

Neural networks trained on datasets such as ImageNet have led to major advances in visual object classification.

Paper
Code

Learning a Driving Simulator

commaai/research • • 3 Aug 2016

Comma. ai's approach to Artificial Intelligence for self-driving cars is based on an agent that learns to clone driver behaviors and plans maneuvers by simulating future events in the road.

Paper
Code

Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model

Hzzone/Precipitation-Nowcasting • • NeurIPS 2017

To address these problems, we propose both a new model and a benchmark for precipitation nowcasting.

Paper
Code

Stochastic Adversarial Video Prediction

alexlee-gk/video_prediction • • ICLR 2019

However, learning to predict raw future observations, such as frames in a video, is exceedingly challenging -- the ambiguous nature of the problem can cause a naively designed model to average together possible futures into a single, blurry prediction.

Paper
Code

Video Prediction

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result