Video Object Segmentation

240 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Segmentation

Dataset	Best Model	Compare
DAVIS 2016	ISVOS (BL30K, MS)	See all
DAVIS 2017 (val)	XMem (BLK30K, MS)	See all
YouTube-VOS 2018	XMem (BL30K, MS)	See all
DAVIS 2017 (test-dev)	BATMAN	See all
YouTube-VOS 2019	XMem (BL30K,MS)	See all
DAVIS 2017	AOC-MF (val)	See all
FBMS	Ours	See all
DAVIS-2017 (test-dev)	XMem (BL30K, MS)	See all
YouTube	Ours	See all

Libraries

Use these libraries to find Video Object Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

560

visionml/pytracking

3 papers

3,080

hkchengrex/Mask-Propagation

3 papers

124

z-x-yang/AOT

3 papers

116

Datasets

Subtasks

Video Salient Object Detection

Interactive Video Object Segmentation

Long-tail Video Object Segmentation

Latest papers

Most implemented Social Latest No code

1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation

robertluo1/iccv2023_rvos_challenge • • 1 Jan 2024

The recent transformer-based models have dominated the Referring Video Object Segmentation (RVOS) task due to the superior performance.

01 Jan 2024

Paper
Code

Tracking with Human-Intent Reasoning

jiawen-zhu/trackgpt • • 29 Dec 2023

The perception component then generates the tracking results based on the embeddings.

29 Dec 2023

Paper
Code

UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces

foundationvision/uniref • • 25 Dec 2023

We evaluate our unified models on various benchmarks.

218

25 Dec 2023

Paper
Code

Hierarchical Graph Pattern Understanding for Zero-Shot VOS

nust-machine-intelligence-laboratory/hgpu • • 15 Dec 2023

However, existing optical flow-based methods have a significant dependency on optical flow, which results in poor performance when the optical flow estimation fails for a particular scene.

15 Dec 2023

Paper
Code

General Object Foundation Model for Images and Videos at Scale

FoundationVision/GLEE • • 14 Dec 2023

We present GLEE in this work, an object-level foundation model for locating and identifying objects in images and videos.

870

14 Dec 2023

Paper
Code

Semi-supervised Active Learning for Video Action Detection

akash2907/semi-sup-active-learning • • 12 Dec 2023

First, we demonstrate its effectiveness on video action detection where the proposed approach outperforms prior works in semi-supervised and weakly-supervised learning along with several baseline approaches in both UCF101-24 and JHMDB-21.

12 Dec 2023

Paper
Code

Flexible visual prompts for in-context learning in computer vision

v7labs/xmem_icl • 11 Dec 2023

Additionally, we propose a technique for support set selection, which involves choosing the most relevant images to include in this set.

11 Dec 2023

Paper
Code

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

shvdiwnkozbw/ssl-uvos • • 29 Nov 2023

In this paper, we propose a simple yet effective approach for self-supervised video object segmentation (VOS).

29 Nov 2023

Paper
Code

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

menglcool/segic • 24 Nov 2023

In-context segmentation aims at segmenting novel images using a few labeled example images, termed as "in-context examples", exploring content similarities between examples and the target.

24 Nov 2023

Paper
Code

Putting the Object Back into Video Object Segmentation

hkchengrex/Cutie • • 19 Oct 2023

We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result.

455

19 Oct 2023

Paper
Code

Video Object Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result