Semi-Supervised Video Object Segmentation

94 papers with code • 15 benchmarks • 13 datasets

The semi-supervised scenario assumes the user inputs a full mask of the object(s) of interest in the first frame of a video sequence. Methods have to produce the segmentation mask for that object(s) in the subsequent frames.

Libraries

Use these libraries to find Semi-Supervised Video Object Segmentation models and implementations

Efficient Video Object Segmentation via Modulated Cross-Attention Memory

amshaker/mavos 26 Mar 2024

Recently, transformer-based approaches have shown promising results for semi-supervised video object segmentation.

7
26 Mar 2024

Video Object Segmentation with Dynamic Query Modulation

zht8506/qmvos 18 Mar 2024

Storing intermediate frame segmentations as memory for long-range context modeling, spatial-temporal memory-based methods have recently showcased impressive results in semi-supervised video object segmentation (SVOS).

4
18 Mar 2024

Lester: rotoscope animation through video object segmentation and tracking

rtous/lester 15 Feb 2024

This article introduces Lester, a novel method to automatically synthetise retro-style 2D animations from videos.

3
15 Feb 2024

ODTrack: Online Dense Temporal Token Learning for Visual Tracking

gxnu-zhonglab/odtrack 3 Jan 2024

To alleviate the above problem, we propose a simple, flexible and effective video-level tracking pipeline, named \textbf{ODTrack}, which densely associates the contextual relationships of video frames in an online token propagation manner.

64
03 Jan 2024

Putting the Object Back into Video Object Segmentation

hkchengrex/Cutie 19 Oct 2023

The object queries act as a high-level summary of the target object, while high-resolution feature maps are retained for accurate segmentation.

413
19 Oct 2023

Tracking Anything with Decoupled Video Segmentation

hkchengrex/Tracking-Anything-with-DEVA ICCV 2023

To 'track anything' without training on video data for every individual task, we develop a decoupled video segmentation approach (DEVA), composed of task-specific image-level segmentation and class/task-agnostic bi-directional temporal propagation.

998
07 Sep 2023

XMem++: Production-level Video Segmentation From Few Annotated Frames

max810/XMem2 ICCV 2023

Despite advancements in user-guided video segmentation, extracting complex objects consistently for highly complex scenes is still a labor-intensive task, especially for production.

125
29 Jul 2023

Tracking Anything in High Quality

jiawen-zhu/hqtrack 26 Jul 2023

To further improve the quality of tracking masks, a pretrained MR model is employed to refine the tracking results.

730
26 Jul 2023

READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation

Vujas-Eteph/READMem 22 May 2023

We present READMem (Robust Embedding Association for a Diverse Memory), a modular framework for semi-automatic video object segmentation (sVOS) methods designed to handle unconstrained videos.

5
22 May 2023

Video Object Segmentation in Panoptic Wild Scenes

yoxu515/aot-benchmark 8 May 2023

Considering the challenges in panoptic VOS, we propose a strong baseline method named panoptic object association with transformers (PAOT), which uses panoptic identification to associate objects with a pyramid architecture on multiple scales.

554
08 May 2023