Future prediction
39 papers with code • 0 benchmarks • 1 datasets
Benchmarks
These leaderboards are used to track progress in Future prediction
Latest papers
TrajPRed: Trajectory Prediction with Region-based Relation Learning
We integrate multi-goal estimation and region-based relation learning to model the two stimuli, social interactions, and stochastic goals, in a prediction framework.
MTD: Multi-Timestep Detector for Delayed Streaming Perception
Autonomous driving systems require real-time environmental perception to ensure user safety and experience.
Video Diffusion Models with Local-Global Context Guidance
We construct a local-global context guidance strategy to capture the multi-perceptual embedding of the past fragment to boost the consistency of future prediction.
Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes
In particular, we find that neural responses are currently best predicted by models trained to predict the future state of their environment in the latent space of pretrained foundation models optimized for dynamic scenes in a self-supervised manner.
Rethinking Learning Approaches for Long-Term Action Anticipation
Action anticipation involves predicting future actions having observed the initial portion of a video.
EgoTaskQA: Understanding Human Tasks in Egocentric Videos
The challenges of such capability lie in the difficulty of generating a detailed understanding of situated actions, their effects on object states (i. e., state changes), and their causal dependencies.
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning
In particular, we propose a spatial-temporal feature learning scheme towards a set of more representative features for perception, prediction and planning tasks simultaneously, which is called ST-P3.
Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory Prediction
Pedestrian trajectory prediction is an essential and challenging task for a variety of real-life applications such as autonomous driving and robotic motion planning.
BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving
Specifically, BEVerse first performs shared feature extraction and lifting to generate 4D BEV representations from multi-timestamp and multi-view images.
Borrowing from yourself: Faster future video segmentation with partial channel update
Semantic segmentation is a well-addressed topic in the computer vision literature, but the design of fast and accurate video processing networks remains challenging.