Video Object Segmentation

243 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Libraries

Use these libraries to find Video Object Segmentation models and implementations

UniVS: Unified and Universal Video Segmentation with Prompts as Queries

minghanli/univs 28 Feb 2024

Despite the recent advances in unified image segmentation (IS), developing a unified video segmentation (VS) model remains a challenge.

123
28 Feb 2024

Lester: rotoscope animation through video object segmentation and tracking

rtous/lester 15 Feb 2024

This article introduces Lester, a novel method to automatically synthetise retro-style 2D animations from videos.

3
15 Feb 2024

Vivim: a Video Vision Mamba for Medical Video Object Segmentation

scott-yjyang/vivim 25 Jan 2024

Traditional convolutional neural networks have a limited receptive field while transformer-based networks are mediocre in constructing long-term dependency from the perspective of computational complexity.

104
25 Jan 2024

OMG-Seg: Is One Model Good Enough For All Segmentation?

lxtgh/omg-seg 18 Jan 2024

In this work, we address various segmentation tasks, each traditionally tackled by distinct or partially unified models.

681
18 Jan 2024

1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation

robertluo1/iccv2023_rvos_challenge 1 Jan 2024

The recent transformer-based models have dominated the Referring Video Object Segmentation (RVOS) task due to the superior performance.

10
01 Jan 2024

Tracking with Human-Intent Reasoning

jiawen-zhu/trackgpt 29 Dec 2023

The perception component then generates the tracking results based on the embeddings.

59
29 Dec 2023

UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces

foundationvision/uniref 25 Dec 2023

We evaluate our unified models on various benchmarks.

222
25 Dec 2023

Hierarchical Graph Pattern Understanding for Zero-Shot VOS

nust-machine-intelligence-laboratory/hgpu 15 Dec 2023

However, existing optical flow-based methods have a significant dependency on optical flow, which results in poor performance when the optical flow estimation fails for a particular scene.

2
15 Dec 2023

General Object Foundation Model for Images and Videos at Scale

FoundationVision/GLEE 14 Dec 2023

We present GLEE in this work, an object-level foundation model for locating and identifying objects in images and videos.

901
14 Dec 2023

Semi-supervised Active Learning for Video Action Detection

akash2907/semi-sup-active-learning 12 Dec 2023

First, we demonstrate its effectiveness on video action detection where the proposed approach outperforms prior works in semi-supervised and weakly-supervised learning along with several baseline approaches in both UCF101-24 and JHMDB-21.

0
12 Dec 2023