Video Prediction
183 papers with code • 19 benchmarks • 24 datasets
Video Prediction is the task of predicting future frames given past video frames.
Gif credit: MAGVIT
Source: Photo-Realistic Video Prediction on Natural Videos of Largely Changing Frames
Libraries
Use these libraries to find Video Prediction models and implementationsDatasets
Most implemented papers
PredNet and Predictive Coding: A Critical Review
We fill in the gap by evaluating PredNet both as an implementation of the predictive coding theory and as a self-supervised video prediction model using a challenging video action classification dataset.
Disentangling Physical Dynamics from Unknown Factors for Unsupervised Video Prediction
Leveraging physical knowledge described by partial differential equations (PDEs) is an appealing way to improve unsupervised video prediction methods.
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning
This paper models these structures by presenting PredRNN, a new recurrent network, in which a pair of memory cells are explicitly decoupled, operate in nearly independent transition manners, and finally form unified representations of the complex environment.
Video Diffusion Models
Generating temporally coherent high fidelity video is an important milestone in generative modeling research.
SimVP: Simpler yet Better Video Prediction
From CNN, RNN, to ViT, we have witnessed remarkable advancements in video prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated training strategies.
Video Prediction Models as Rewards for Reinforcement Learning
A promising approach is to extract preferences for behaviors from unlabeled videos, which are widely available on the internet.
Expert Gate: Lifelong Learning with a Network of Experts
Further, the autoencoders inherently capture the relatedness of one task to another, based on which the most relevant prior model to be used for training a new expert, with finetuning or learning without-forgetting, can be selected.
Predicting Deeper into the Future of Semantic Segmentation
The ability to predict and therefore to anticipate the future is an important attribute of intelligence.
Learning to Generate Long-term Future via Hierarchical Prediction
To avoid inherent compounding errors in recursive pixel-level prediction, we propose to first estimate high-level structure in the input frames, then predict how that structure evolves in the future, and finally by observing a single frame from the past and the predicted high-level structure, we construct the future frames without having to observe any of the pixel-level predictions.
Prediction Under Uncertainty with Error-Encoding Networks
In this work we introduce a new framework for performing temporal predictions in the presence of uncertainty.