Video Object Segmentation

243 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Libraries

Use these libraries to find Video Object Segmentation models and implementations

Flexible visual prompts for in-context learning in computer vision

v7labs/xmem_icl 11 Dec 2023

Additionally, we propose a technique for support set selection, which involves choosing the most relevant images to include in this set.

6
11 Dec 2023

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

shvdiwnkozbw/ssl-uvos 29 Nov 2023

In this paper, we propose a simple yet effective approach for self-supervised video object segmentation (VOS).

20
29 Nov 2023

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

menglcool/segic 24 Nov 2023

In-context segmentation aims at segmenting novel images using a few labeled example images, termed as "in-context examples", exploring content similarities between examples and the target.

13
24 Nov 2023

Putting the Object Back into Video Object Segmentation

hkchengrex/Cutie 19 Oct 2023

We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result.

474
19 Oct 2023

Treating Motion as Option with Output Selection for Unsupervised Video Object Segmentation

suhwan-cho/tmo 26 Sep 2023

Unsupervised video object segmentation (VOS) is a task that aims to detect the most salient object in a video without external guidance about the object.

45
26 Sep 2023

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

shilinyan99/panovos 21 Sep 2023

Our dataset poses new challenges in panoramic VOS and we hope that our PanoVOS can advance the development of panoramic segmentation/tracking.

12
21 Sep 2023

Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation

nankepan/VIPMT ICCV 2023

We decompose the query video information into a clip prototype and a memory prototype for capturing local and long-term internal temporal guidance, respectively.

5
20 Sep 2023

Tracking Anything with Decoupled Video Segmentation

hkchengrex/Tracking-Anything-with-DEVA ICCV 2023

To 'track anything' without training on video data for every individual task, we develop a decoupled video segmentation approach (DEVA), composed of task-specific image-level segmentation and class/task-agnostic bi-directional temporal propagation.

1,068
07 Sep 2023

Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples

hengliusky/few_shot_rvos ICCV 2023

Referring video object segmentation (RVOS), as a supervised learning task, relies on sufficient annotated data for a given scene.

5
05 Sep 2023

Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation

yoxu515/mits ICCV 2023

Tracking any given object(s) spatially and temporally is a common purpose in Visual Object Tracking (VOT) and Video Object Segmentation (VOS).

14
25 Aug 2023