Video Object Segmentation

242 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Segmentation

Dataset	Best Model	Compare
DAVIS 2016	ISVOS (BL30K, MS)	See all
DAVIS 2017 (val)	XMem (BLK30K, MS)	See all
YouTube-VOS 2018	XMem (BL30K, MS)	See all
DAVIS 2017 (test-dev)	BATMAN	See all
YouTube-VOS 2019	XMem (BL30K,MS)	See all
DAVIS 2017	AOC-MF (val)	See all
FBMS	Ours	See all
DAVIS-2017 (test-dev)	XMem (BL30K, MS)	See all
YouTube	Ours	See all

Libraries

Use these libraries to find Video Object Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

563

visionml/pytracking

3 papers

3,086

hkchengrex/Mask-Propagation

3 papers

124

z-x-yang/AOT

3 papers

117

Datasets

Subtasks

Video Salient Object Detection

Interactive Video Object Segmentation

Long-tail Video Object Segmentation

Latest papers with no code

Most implemented Social Latest No code

360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos

no code yet • 22 Apr 2024

Visual object tracking and segmentation in omnidirectional videos are challenging due to the wide field-of-view and large spherical distortion brought by 360{\deg} images.

Paper
Add Code

Spatial-Temporal Multi-level Association for Video Object Segmentation

no code yet • 9 Apr 2024

In addition, we propose a spatial-temporal memory to assist feature association and temporal ID assignment and correlation.

Paper
Add Code

Event-assisted Low-Light Video Object Segmentation

no code yet • 2 Apr 2024

In the realm of video object segmentation (VOS), the challenge of operating under low-light conditions persists, resulting in notably degraded image quality and compromised accuracy when comparing query and memory frames for similarity computation.

Paper
Add Code

Annolid: Annotate, Segment, and Track Anything You Need

no code yet • 27 Mar 2024

Annolid is a deep learning-based software package designed for the segmentation, labeling, and tracking of research targets within video files, focusing primarily on animal behavior analysis.

Paper
Add Code

OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework

no code yet • 13 Mar 2024

Contemporary Video Object Segmentation (VOS) approaches typically consist stages of feature extraction, matching, memory management, and multiple objects aggregation.

Paper
Add Code

Real-time Surgical Instrument Segmentation in Video Using Point Tracking and Segment Anything

no code yet • 12 Mar 2024

Inspired by this progress, we present a novel framework that combines an online point tracker with a lightweight SAM model that is fine-tuned for surgical instrument segmentation.

Paper
Add Code

ClickVOS: Click Video Object Segmentation

no code yet • 10 Mar 2024

To address these limitations, we propose the setting named Click Video Object Segmentation (ClickVOS) which segments objects of interest across the whole video according to a single click per object in the first frame.

Paper
Add Code

Depth-aware Test-Time Training for Zero-shot Video Object Segmentation

no code yet • 7 Mar 2024

In this work, we introduce a test-time training (TTT) strategy to address the problem.

Paper
Add Code

Moving Object Proposals with Deep Learned Optical Flow for Video Object Segmentation

no code yet • 14 Feb 2024

Then we render the output of optical flow net to a fully convolutional SegNet model.

Paper
Add Code

Point-VOS: Pointing Up Video Object Segmentation

no code yet • 8 Feb 2024

We propose a novel Point-VOS task with a spatio-temporally sparse point-wise annotation scheme that substantially reduces the annotation effort.

Paper
Add Code

Video Object Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result