Video Object Segmentation

240 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Libraries

Use these libraries to find Video Object Segmentation models and implementations

Latest papers with no code

Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention

no code yet • 25 Jan 2024

This is enabled by deformable attention mechanism, where the keys and values capturing the memory of a video sequence in the attention module have flexible locations updated across frames.

Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation

no code yet • 23 Jan 2024

Interactive Video Object Segmentation (iVOS) is a challenging task that requires real-time human-computer interaction.

Understanding Video Transformers via Universal Concept Discovery

no code yet • 19 Jan 2024

Concretely, we seek to explain the decision-making process of video transformers based on high-level, spatiotemporal concepts that are automatically discovered.

No More Shortcuts: Realizing the Potential of Temporal Self-Supervision

no code yet • 20 Dec 2023

To address these issues, we propose 1) a more challenging reformulation of temporal self-supervision as frame-level (rather than clip-level) recognition tasks and 2) an effective augmentation strategy to mitigate shortcuts.

TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and Tracking

no code yet • 13 Dec 2023

In this work we propose a novel, clip-based DETR-style encoder-decoder architecture, which focuses on systematically analyzing and addressing aforementioned challenges.

VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models

no code yet • 30 Nov 2023

Our model can edit and translate the desired results within seconds based on user instructions.

SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation

no code yet • 30 Nov 2023

We evaluate our method on several benchmark datasets and achieve state-of-the-art results.

Sketch-based Video Object Segmentation: Benchmark and Analysis

no code yet • 13 Nov 2023

Reference-based video object segmentation is an emerging topic which aims to segment the corresponding target object in each video frame referred by a given reference, such as a language expression or a photo mask.

Learning the What and How of Annotation in Video Object Segmentation

no code yet • 8 Nov 2023

To reduce this annotation cost, in this paper, we propose EVA-VOS, a human-in-the-loop annotation framework for video object segmentation.

ISAR: A Benchmark for Single- and Few-Shot Object Instance Segmentation and Re-Identification

no code yet • 5 Nov 2023

To build spatial AI systems that can quickly be taught about new objects, we need to effectively solve the problem of single-shot object detection, instance segmentation and re-identification.