Semi-Supervised Video Object Segmentation

95 papers with code • 15 benchmarks • 13 datasets

The semi-supervised scenario assumes the user inputs a full mask of the object(s) of interest in the first frame of a video sequence. Methods have to produce the segmentation mask for that object(s) in the subsequent frames.

Benchmarks

Add a Result

These leaderboards are used to track progress in Semi-Supervised Video Object Segmentation

Dataset	Best Model	Compare
DAVIS 2017 (val)	Cutie+ (base)	See all
DAVIS 2016	ISVOS (BL30K, MS)	See all
DAVIS 2017 (test-dev)	Cutie+ (base, MEGA)	See all
YouTube-VOS 2018	Cutie+ (base, MEGA)	See all
DAVIS (no YouTube-VOS training)	HMMN	See all
YouTube-VOS 2019	Cutie+ (base, MEGA)	See all
VOT2020	SwinB-DeAOT-L	See all
MOSE	Cutie+ (base, MEGA)	See all
Long Video Dataset	ISVOS	See all
YouTube	FEELVOS	See all
DAVIS-2017	STCN + TrickVOS (PT)	See all
Long Video Dataset (3X)	XMem	See all
BURST-val	Cutie (base, MEGA, 600 pixels)	See all
BURST-test	Cutie (base, MEGA, 600 pixels)	See all
DAVIS-2016	STCN + TrickVOS (PT)	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Semi-Supervised Video Object Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

564

hkchengrex/Mask-Propagation

3 papers

124

z-x-yang/AOT

3 papers

119

Datasets

Subtasks

One-shot visual object segmentation

Latest papers

Most implemented Social Latest No code

Towards Robust Video Object Segmentation with Adaptive Object Calibration

jerryx1110/robust-video-object-segmentation • • 2 Jul 2022

We consolidate this conditional mask calibration process in a progressive manner, where the object representations and proto-masks evolve to be discriminative iteratively.

02 Jul 2022

Paper
Code

Recurrent Dynamic Embedding for Video Object Segmentation

limingxing00/rde-vos-cvpr2022 • • CVPR 2022

In this paper, we propose a Recurrent Dynamic Embedding (RDE) to build a memory bank of constant size.

08 May 2022

Paper
Code

Boosting Video Object Segmentation based on Scale Inconsistency

HengyiWang/SIRNet • • 2 May 2022

We present a refinement framework to boost the performance of pre-trained semi-supervised video object segmentation (VOS) models.

02 May 2022

Paper
Code

Adaptive Memory Management for Video Object Segmentation

alipga/AMM_VOS • • 13 Apr 2022

Matching-based networks have achieved state-of-the-art performance for video object segmentation (VOS) tasks by storing every-k frames in an external memory bank for future inference.

13 Apr 2022

Paper
Code

Scalable Video Object Segmentation with Identification Mechanism

yoxu515/aot-benchmark • • 22 Mar 2022

This paper delves into the challenges of achieving scalable and effective multi-object modeling for semi-supervised Video Object Segmentation (VOS).

564

22 Mar 2022

Paper
Code

MixFormer: End-to-End Tracking with Iterative Mixed Attention

MCG-NJU/MixFormer • • CVPR 2022

Our core design is to utilize the flexibility of attention operations, and propose a Mixed Attention Module (MAM) for simultaneous feature extraction and target information integration.

424

21 Mar 2022

Paper
Code

Siamese Network with Interactive Transformer for Video Object Segmentation

lanmng/sitvos • • 28 Dec 2021

Semi-supervised video object segmentation (VOS) refers to segmenting the target object in remaining frames given its annotation in the first frame, which has been actively studied in recent years.

28 Dec 2021

Paper
Code

Reliable Propagation-Correction Modulation for Video Object Segmentation

jerryx1110/rpcmvos • • 6 Dec 2021

We introduce two modulators, propagation and correction modulators, to separately perform channel-wise re-calibration on the target frame embeddings according to local temporal correlations and reliable references respectively.

06 Dec 2021

Paper
Code

FAMINet: Learning Real-time Semi-supervised Video Object Segmentation with Steepest Optimized Optical Flow

liuziyang123/faminet • • 20 Nov 2021

A FAMINet, which consists of a feature extraction network (F), an appearance network (A), a motion network (M), and an integration network (I), is proposed in this study to address the abovementioned problem.

20 Nov 2021

Paper
Code

Dense Unsupervised Learning for Video Segmentation

visinf/dense-ulearn-vos • • NeurIPS 2021

On established VOS benchmarks, our approach exceeds the segmentation accuracy of previous work despite using significantly less training data and compute power.

182

11 Nov 2021

Paper
Code

Semi-Supervised Video Object Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result