Optical Flow Estimation

652 papers with code • 10 benchmarks • 33 datasets

Optical Flow Estimation is a computer vision task that involves computing the motion of objects in an image or a video sequence. The goal of optical flow estimation is to determine the movement of pixels or features in the image, which can be used for various applications such as object tracking, motion analysis, and video compression.

Approaches for optical flow estimation include correlation-based, block-matching, feature tracking, energy-based, and more recently gradient-based.

Benchmarks

Add a Result

These leaderboards are used to track progress in Optical Flow Estimation

Dataset	Best Model	Compare
Sintel-clean	GMFlow	See all
Sintel-final	FlowFormer	See all
KITTI 2015 (train)	DEQ-Flow-H	See all
KITTI 2015	CamLiRAFT	See all
KITTI 2012	CroCo-Flow	See all
Spring	CroCo-Flow	See all
Sintel Clean unsupervised	MDFlow	See all
Sintel Final unsupervised	UpFlow	See all
KITTI 2015 unsupervised	MDFlow	See all
KITTI 2012 unsupervised	ARFlow-MV	See all

Libraries

Use these libraries to find Optical Flow Estimation models and implementations

open-mmlab/mmflow

9 papers

891

neu-vig/ezflow

5 papers

128

neu-vi/ezflow

5 papers

128

Datasets

Subtasks

Video Stabilization

Latest papers with no code

Most implemented Social Latest No code

Deep-learning Optical Flow Outperforms PIV in Obtaining Velocity Fields from Active Nematics

no code yet • 23 Apr 2024

Deep learning-based optical flow (DLOF) extracts features in adjacent video frames with deep convolutional neural networks.

Paper
Add Code

Structure-Aware Human Body Reshaping with Adaptive Affinity-Graph Network

no code yet • 22 Apr 2024

Particularly, an SRM filter is utilized to extract high-frequency details, which are combined with spatial features as input to the BSD.

Paper
Add Code

Attack on Scene Flow using Point Clouds

no code yet • 21 Apr 2024

Robustness of these techniques, however, remains a concern, particularly in the face of adversarial attacks that have been proven to deceive state-of-the-art deep neural networks in many domains.

Paper
Add Code

Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence

no code yet • 21 Apr 2024

Tackling image degradation due to atmospheric turbulence, particularly in dynamic environment, remains a challenge for long-range imaging systems.

Paper
Add Code

3D Multi-frame Fusion for Video Stabilization

no code yet • 19 Apr 2024

In this paper, we present RStab, a novel framework for video stabilization that integrates 3D multi-frame fusion through volume rendering.

Paper
Add Code

Vision-based control for landing an aerial vehicle on a marine vessel

no code yet • 17 Apr 2024

This work addresses the landing problem of an aerial vehicle, exemplified by a simple quadrotor, on a moving platform using image-based visual servo control.

Paper
Add Code

TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation

no code yet • 17 Apr 2024

These results indicate the overall effectiveness of our approach and make a strong case for aggregating temporal information in both image and BEV latent spaces.

Paper
Add Code

Improving Bracket Image Restoration and Enhancement with Flow-guided Alignment and Enhanced Feature Aggregation

no code yet • 16 Apr 2024

In this paper, we address the Bracket Image Restoration and Enhancement (BracketIRE) task using a novel framework, which requires restoring a high-quality high dynamic range (HDR) image from a sequence of noisy, blurred, and low dynamic range (LDR) multi-exposure RAW inputs.

Paper
Add Code

FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features

no code yet • 15 Apr 2024

The task of face reenactment is to transfer the head motion and facial expressions from a driving video to the appearance of a source image, which may be of a different person (cross-reenactment).

Paper
Add Code

Table tennis ball spin estimation with an event camera

no code yet • 15 Apr 2024

In table tennis, the combination of high velocity and spin renders traditional low frame rate cameras inadequate for quickly and accurately observing the ball's logo to estimate the spin due to the motion blur.

Paper
Add Code

Optical Flow Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result