Video Object Segmentation

240 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Libraries

Use these libraries to find Video Object Segmentation models and implementations

1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation

robertluo1/iccv2023_rvos_challenge 1 Jan 2024

The recent transformer-based models have dominated the Referring Video Object Segmentation (RVOS) task due to the superior performance.

10
01 Jan 2024

Tracking with Human-Intent Reasoning

jiawen-zhu/trackgpt 29 Dec 2023

The perception component then generates the tracking results based on the embeddings.

52
29 Dec 2023

UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces

foundationvision/uniref 25 Dec 2023

We evaluate our unified models on various benchmarks.

218
25 Dec 2023

Hierarchical Graph Pattern Understanding for Zero-Shot VOS

nust-machine-intelligence-laboratory/hgpu 15 Dec 2023

However, existing optical flow-based methods have a significant dependency on optical flow, which results in poor performance when the optical flow estimation fails for a particular scene.

2
15 Dec 2023

General Object Foundation Model for Images and Videos at Scale

FoundationVision/GLEE 14 Dec 2023

We present GLEE in this work, an object-level foundation model for locating and identifying objects in images and videos.

870
14 Dec 2023

Semi-supervised Active Learning for Video Action Detection

akash2907/semi-sup-active-learning 12 Dec 2023

First, we demonstrate its effectiveness on video action detection where the proposed approach outperforms prior works in semi-supervised and weakly-supervised learning along with several baseline approaches in both UCF101-24 and JHMDB-21.

0
12 Dec 2023

Flexible visual prompts for in-context learning in computer vision

v7labs/xmem_icl 11 Dec 2023

Additionally, we propose a technique for support set selection, which involves choosing the most relevant images to include in this set.

5
11 Dec 2023

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

shvdiwnkozbw/ssl-uvos 29 Nov 2023

In this paper, we propose a simple yet effective approach for self-supervised video object segmentation (VOS).

20
29 Nov 2023

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

menglcool/segic 24 Nov 2023

In-context segmentation aims at segmenting novel images using a few labeled example images, termed as "in-context examples", exploring content similarities between examples and the target.

12
24 Nov 2023

Putting the Object Back into Video Object Segmentation

hkchengrex/Cutie 19 Oct 2023

We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result.

455
19 Oct 2023