Video Object Segmentation

243 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Segmentation

Dataset	Best Model	Compare
DAVIS 2016	ISVOS (BL30K, MS)	See all
DAVIS 2017 (val)	XMem (BLK30K, MS)	See all
YouTube-VOS 2018	XMem (BL30K, MS)	See all
DAVIS 2017 (test-dev)	BATMAN	See all
YouTube-VOS 2019	XMem (BL30K,MS)	See all
DAVIS 2017	AOC-MF (val)	See all
FBMS	Ours	See all
DAVIS-2017 (test-dev)	XMem (BL30K, MS)	See all
YouTube	Ours	See all

Libraries

Use these libraries to find Video Object Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

563

visionml/pytracking

3 papers

3,089

hkchengrex/Mask-Propagation

3 papers

124

z-x-yang/AOT

3 papers

118

Datasets

Subtasks

Video Salient Object Detection

Interactive Video Object Segmentation

Long-tail Video Object Segmentation

Latest papers with no code

Most implemented Social Latest No code

Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models

no code yet • 10 Oct 2023

This begs the question: can we re-purpose these large-scale pre-trained static image models for open-vocabulary video tracking?

Paper
Add Code

Sub-token ViT Embedding via Stochastic Resonance Transformers

no code yet • 6 Oct 2023

We term our method ``Stochastic Resonance Transformer" (SRT), which we show can effectively super-resolve features of pre-trained ViTs, capturing more of the local fine-grained structures that might otherwise be neglected as a result of tokenization.

Paper
Add Code

CoralVOS: Dataset and Benchmark for Coral Video Segmentation

no code yet • 3 Oct 2023

We perform experiments on our CoralVOS dataset, including 6 recent state-of-the-art video object segmentation (VOS) algorithms.

Paper
Add Code

Memory-Efficient Continual Learning Object Segmentation for Long Video

no code yet • 26 Sep 2023

We propose two novel techniques to reduce the memory requirement of Online VOS methods while improving modeling accuracy and generalization on long videos.

Paper
Add Code

Adversarial Attacks on Video Object Segmentation with Hard Region Discovery

no code yet • 25 Sep 2023

Particularly, the gradients from the segmentation model are exploited to discover the easily confused region, in which it is difficult to identify the pixel-wise objects from the background in a frame.

Paper
Add Code

Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation

no code yet • 21 Sep 2023

Referring Video Object Segmentation (RVOS) requires segmenting the object in video referred by a natural language query.

Paper
Add Code

Efficient Long-Short Temporal Attention Network for Unsupervised Video Object Segmentation

no code yet • 21 Sep 2023

Unsupervised Video Object Segmentation (VOS) aims at identifying the contours of primary foreground objects in videos without any prior knowledge.

Paper
Add Code

Temporal Collection and Distribution for Referring Video Object Segmentation

no code yet • ICCV 2023

Furthermore, to explicitly capture object motions and spatial-temporal cross-modal reasoning over objects, we propose a novel temporal collection-distribution mechanism for interacting between the global referent token and object queries.

Paper
Add Code

Robust Visual Tracking by Motion Analyzing

no code yet • 6 Sep 2023

In this paper, we propose a new algorithm that addresses this limitation by analyzing the motion pattern using the inherent tensor structure.

Paper
Add Code

Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation

no code yet • 25 Aug 2023

To overcome these issues, we propose a unified VOS framework, coined as JointFormer, for joint modeling the three elements of feature, correspondence, and a compressed memory.

Paper
Add Code

Video Object Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result