Video Object Segmentation

243 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Segmentation

Dataset	Best Model	Compare
DAVIS 2016	ISVOS (BL30K, MS)	See all
DAVIS 2017 (val)	XMem (BLK30K, MS)	See all
YouTube-VOS 2018	XMem (BL30K, MS)	See all
DAVIS 2017 (test-dev)	BATMAN	See all
YouTube-VOS 2019	XMem (BL30K,MS)	See all
DAVIS 2017	AOC-MF (val)	See all
FBMS	Ours	See all
DAVIS-2017 (test-dev)	XMem (BL30K, MS)	See all
YouTube	Ours	See all

Libraries

Use these libraries to find Video Object Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

563

visionml/pytracking

3 papers

3,089

hkchengrex/Mask-Propagation

3 papers

124

z-x-yang/AOT

3 papers

118

Datasets

Subtasks

Video Salient Object Detection

Interactive Video Object Segmentation

Long-tail Video Object Segmentation

Latest papers

Most implemented Social Latest No code

Flexible visual prompts for in-context learning in computer vision

v7labs/xmem_icl • 11 Dec 2023

Additionally, we propose a technique for support set selection, which involves choosing the most relevant images to include in this set.

11 Dec 2023

Paper
Code

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

shvdiwnkozbw/ssl-uvos • • 29 Nov 2023

In this paper, we propose a simple yet effective approach for self-supervised video object segmentation (VOS).

29 Nov 2023

Paper
Code

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

menglcool/segic • 24 Nov 2023

In-context segmentation aims at segmenting novel images using a few labeled example images, termed as "in-context examples", exploring content similarities between examples and the target.

24 Nov 2023

Paper
Code

Putting the Object Back into Video Object Segmentation

hkchengrex/Cutie • • 19 Oct 2023

We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result.

474

19 Oct 2023

Paper
Code

Treating Motion as Option with Output Selection for Unsupervised Video Object Segmentation

suhwan-cho/tmo • • 26 Sep 2023

Unsupervised video object segmentation (VOS) is a task that aims to detect the most salient object in a video without external guidance about the object.

26 Sep 2023

Paper
Code

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

shilinyan99/panovos • 21 Sep 2023

Our dataset poses new challenges in panoramic VOS and we hope that our PanoVOS can advance the development of panoramic segmentation/tracking.

21 Sep 2023

Paper
Code

Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation

nankepan/VIPMT • • ICCV 2023

We decompose the query video information into a clip prototype and a memory prototype for capturing local and long-term internal temporal guidance, respectively.

20 Sep 2023

Paper
Code

Tracking Anything with Decoupled Video Segmentation

hkchengrex/Tracking-Anything-with-DEVA • • ICCV 2023

To 'track anything' without training on video data for every individual task, we develop a decoupled video segmentation approach (DEVA), composed of task-specific image-level segmentation and class/task-agnostic bi-directional temporal propagation.

1,068

07 Sep 2023

Paper
Code