Video Prediction

183 papers with code • 19 benchmarks • 24 datasets

Video Prediction is the task of predicting future frames given past video frames.

Gif credit: MAGVIT

Source: Photo-Realistic Video Prediction on Natural Videos of Largely Changing Frames

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Prediction

Dataset	Best Model	Compare
KTH	Grid-keypoints	See all
Moving MNIST	SimVP+gSTA-Sx10	See all
Kinetics-600 12 frames, 64x64	MAGVIT-v2	See all
Human3.6M	IAM4VP	See all
BAIR Robot Pushing	MAGVIT (-L-FP)	See all
Cityscapes 128x128	GHVAEs	See all
SynpickVP	MSPred	See all
CMU Mocap-2	Latent SDE	See all
Cityscapes	DMVFN	See all
KITTI	DMVFN	See all
CMU Mocap-1	ODE2VAE-KL	See all
DAVIS 2017	DMVFN	See all
Vimeo90K	OPT	See all
Colored dSprites	MGP-VAE (with geodesic loss)	See all
Sprites	MGP-VAE (with geodesic loss)	See all
YouTube-8M	SDCNet	See all
KTH 64x64 cond10 pred30	SRVP	See all
Something-Something V2	MAGVIT	See all
MPI Sintel	MCnet [villegas2017mcnet]	See all

Show all 19 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Video Prediction models and implementations

chengtan9907/simvpv2

10 papers

579

chengtan9907/OpenSTL

4 papers

579

Flunzmas/vp-suite

3 papers

tensorflow/tensor2tensor

2 papers

14,918

See all 6 libraries.

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT

yuhao-nie/stanford-solar-forecasting-dataset • • 20 Jun 2023

Furthermore, we feed the generated future sky images from the video prediction models for 15-minute-ahead probabilistic solar forecasting for a 30-kW roof-top PV system, and compare it with an end-to-end deep learning baseline model SUNSET and a smart persistence model.

20 Jun 2023

Paper
Code

Fast Fourier Inception Networks for Occluded Video Prediction

mlvccn/research • • 17 Jun 2023

Video prediction is a pixel-level task that generates future frames by employing the historical frames.

17 Jun 2023

Paper
Code

DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles

taldatech/ddlp • • 9 Jun 2023

We propose a new object-centric video prediction algorithm based on the deep latent particle (DLP) representation.

09 Jun 2023

Paper
Code

Video Diffusion Models with Local-Global Context Guidance

exisas/lgc-vd • • 5 Jun 2023

We construct a local-global context guidance strategy to capture the multi-perceptual embedding of the past fragment to boost the consistency of future prediction.

05 Jun 2023

Paper
Code

Video Prediction Models as Rewards for Reinforcement Learning

Alescontrela/viper_rl • • NeurIPS 2023

A promising approach is to extract preferences for behaviors from unlabeled videos, which are widely available on the internet.

23 May 2023

Paper
Code

Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought

vaishnavihimakunthala/vip • 23 May 2023

Despite exciting recent results showing vision-language systems' capacity to reason about images using natural language, their capacity for video reasoning remains under-explored.

23 May 2023

Paper
Code

VDT: General-purpose Video Diffusion Transformers via Mask Modeling

rerv/vdt • • 22 May 2023

We also propose a unified spatial-temporal mask modeling mechanism, seamlessly integrated with the model, to cater to diverse video generation scenarios.

158

22 May 2023

Paper
Code

PastNet: Introducing Physical Inductive Biases for Spatio-temporal Video Prediction

easylearningscores/pastnet • • 19 May 2023

In this paper, we investigate the challenge of spatio-temporal video prediction, which involves generating future videos based on historical data streams.

19 May 2023

Paper
Code

A Control-Centric Benchmark for Video Prediction

s-tian/vp2 • • 26 Apr 2023

Video is a promising source of knowledge for embodied agents to learn models of the world's dynamics.

26 Apr 2023

Paper
Code

Multi-modal learning for geospatial vegetation forecasting

earthnet2021/earthnet-models-pytorch • • 28 Mar 2023

Our study breaks new ground by introducing GreenEarthNet, the first dataset specifically designed for high-resolution vegetation forecasting, and Contextformer, a novel deep learning approach for predicting vegetation greenness from Sentinel 2 satellite images with fine resolution across Europe.

28 Mar 2023

Paper
Code

Video Prediction

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result