Optical Flow Estimation

655 papers with code • 10 benchmarks • 34 datasets

Optical Flow Estimation is a computer vision task that involves computing the motion of objects in an image or a video sequence. The goal of optical flow estimation is to determine the movement of pixels or features in the image, which can be used for various applications such as object tracking, motion analysis, and video compression.

Approaches for optical flow estimation include correlation-based, block-matching, feature tracking, energy-based, and more recently gradient-based.

Benchmarks

Add a Result

These leaderboards are used to track progress in Optical Flow Estimation

Dataset	Best Model	Compare
Sintel-clean	GMFlow	See all
Sintel-final	FlowFormer	See all
KITTI 2015 (train)	DEQ-Flow-H	See all
KITTI 2015	CamLiRAFT	See all
KITTI 2012	CroCo-Flow	See all
Spring	CroCo-Flow	See all
Sintel Clean unsupervised	MDFlow	See all
Sintel Final unsupervised	UpFlow	See all
KITTI 2015 unsupervised	MDFlow	See all
KITTI 2012 unsupervised	ARFlow-MV	See all

Libraries

Use these libraries to find Optical Flow Estimation models and implementations

open-mmlab/mmflow

9 papers

895

neu-vig/ezflow

5 papers

129

neu-vi/ezflow

5 papers

128

Datasets

Subtasks

Video Stabilization

Latest papers

Most implemented Social Latest No code

Rethinking Low-quality Optical Flow in Unsupervised Surgical Instrument Segmentation

wpr1018001/rethinking-low-quality-optical-flow • • 15 Mar 2024

Video-based surgical instrument segmentation plays an important role in robot-assisted surgeries.

15 Mar 2024

Paper
Code

LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding

bigai-nlco/lstp-chat • • 25 Feb 2024

Despite progress in video-language modeling, the computational challenge of interpreting long-form videos in response to task-specific linguistic queries persists, largely due to the complexity of high-dimensional video data and the misalignment between language and visual cues over space and time.

25 Feb 2024

Paper
Code

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion

Yui010206/CREMA • • 8 Feb 2024

Furthermore, we propose a fusion module designed to compress multimodal queries, maintaining computational efficiency in the LLM while combining additional modalities.

08 Feb 2024

Paper
Code

Taylor Videos for Action Recognition

leiwangr/video-ar • 5 Feb 2024

Addressing these challenges, we propose the Taylor video, a new video format that highlights the dominate motions (e. g., a waving hand) in each of its frames named the Taylor frame.

05 Feb 2024

Paper
Code

Recurrent Partial Kernel Network for Efficient Optical Flow Estimation

hmorimitsu/ptlflow • • The 38th Annual AAAI Conference on Artificial Intelligence (AAAI) 2024

However, this impacts the widespread adoption of optical flow methods and makes it harder to train more general models since the optical flow data is hard to obtain.

206

01 Feb 2024

Paper
Code

Multimodal Action Quality Assessment

qinghuannn/pamfn • • 31 Jan 2024

To leverage multimodal information for AQA, i. e., RGB, optical flow and audio information, we propose a Progressive Adaptive Multimodal Fusion Network (PAMFN) that separately models modality-specific information and mixed-modality information.

31 Jan 2024

Paper
Code

VONet: Unsupervised Video Object Learning With Parallel U-Net Attention and Object-wise Sequential VAE

hnyu/vonet • • 20 Jan 2024

Unsupervised video object learning seeks to decompose video scenes into structural object representations without any supervision from depth, optical flow, or segmentation.

20 Jan 2024

Paper
Code

Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration Network

jhw2000/jarnet • • 16 Jan 2024

Both the proposed JARNet and LAP image synthesis pipeline establish a foundation for addressing this intricate challenge.

16 Jan 2024

Paper
Code

RomniStereo: Recurrent Omnidirectional Stereo Matching

halleyjiang/romnistereo • • 9 Jan 2024

To bridge the gap between OSM and RAFT, we mainly propose an opposite adaptive weighting scheme to seamlessly transform the outputs of spherical sweeping of OSM into the required inputs for the recurrent update, thus creating a recurrent omnidirectional stereo matching (RomniStereo) algorithm.

09 Jan 2024

Paper
Code

Rethinking RAFT for Efficient Optical Flow

n3slami/Ef-RAFT • • 1 Jan 2024

To address these problems, this paper proposes a novel approach based on the RAFT framework.

01 Jan 2024

Paper
Code

Optical Flow Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result