Video Object Segmentation

240 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Segmentation

Dataset	Best Model	Compare
DAVIS 2016	ISVOS (BL30K, MS)	See all
DAVIS 2017 (val)	XMem (BLK30K, MS)	See all
YouTube-VOS 2018	XMem (BL30K, MS)	See all
DAVIS 2017 (test-dev)	BATMAN	See all
YouTube-VOS 2019	XMem (BL30K,MS)	See all
DAVIS 2017	AOC-MF (val)	See all
FBMS	Ours	See all
DAVIS-2017 (test-dev)	XMem (BL30K, MS)	See all
YouTube	Ours	See all

Libraries

Use these libraries to find Video Object Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

560

visionml/pytracking

3 papers

3,080

hkchengrex/Mask-Propagation

3 papers

124

z-x-yang/AOT

3 papers

116

Datasets

Subtasks

Video Salient Object Detection

Interactive Video Object Segmentation

Long-tail Video Object Segmentation

Latest papers

Most implemented Social Latest No code

Towards Temporally Consistent Referring Video Object Segmentation

bo-miao/HTR • • 28 Mar 2024

Referring Video Object Segmentation (R-VOS) methods face challenges in maintaining consistent object segmentation due to temporal context variability and the presence of other visually similar objects.

28 Mar 2024

Paper
Code

Efficient Video Object Segmentation via Modulated Cross-Attention Memory

amshaker/mavos • 26 Mar 2024

Recently, transformer-based approaches have shown promising results for semi-supervised video object segmentation.

26 Mar 2024

Paper
Code

PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model

zamling/psalm • • 21 Mar 2024

PSALM is a powerful extension of the Large Multi-modal Model (LMM) to address the segmentation task challenges.

111

21 Mar 2024

Paper
Code

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

buxiangzhiren/vd-it • • 18 Mar 2024

We hypothesize that the latent representation learned from a pretrained generative T2V model encapsulates rich semantics and coherent temporal correspondences, thereby naturally facilitating video understanding.

18 Mar 2024

Paper
Code

Video Object Segmentation with Dynamic Query Modulation

zht8506/qmvos • • 18 Mar 2024

Storing intermediate frame segmentations as memory for long-range context modeling, spatial-temporal memory-based methods have recently showcased impressive results in semi-supervised video object segmentation (SVOS).

18 Mar 2024

Paper
Code

VideoMAC: Video Masked Autoencoders Meet ConvNets

nust-machine-intelligence-laboratory/videomac • • 29 Feb 2024

In this paper, we propose a new approach termed as \textbf{VideoMAC}, which combines video masked autoencoders with resource-friendly ConvNets.

29 Feb 2024

Paper
Code

UniVS: Unified and Universal Video Segmentation with Prompts as Queries

minghanli/univs • • 28 Feb 2024

Despite the recent advances in unified image segmentation (IS), developing a unified video segmentation (VS) model remains a challenge.

113

28 Feb 2024

Paper
Code

Lester: rotoscope animation through video object segmentation and tracking

rtous/lester • 15 Feb 2024

This article introduces Lester, a novel method to automatically synthetise retro-style 2D animations from videos.

15 Feb 2024

Paper
Code

Vivim: a Video Vision Mamba for Medical Video Object Segmentation

scott-yjyang/vivim • • 25 Jan 2024

Traditional convolutional neural networks have a limited receptive field while transformer-based networks are mediocre in constructing long-term dependency from the perspective of computational complexity.

25 Jan 2024

Paper
Code

OMG-Seg: Is One Model Good Enough For All Segmentation?

lxtgh/omg-seg • • 18 Jan 2024

In this work, we address various segmentation tasks, each traditionally tackled by distinct or partially unified models.

681

18 Jan 2024

Paper
Code

Video Object Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result