Video Semantic Segmentation

321 papers with code • 5 benchmarks • 8 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Semantic Segmentation

Dataset	Best Model	Compare
Cityscapes val	TMANet-50	See all
CamVid	TMANet-50	See all
VSPW	DVIS++(VIT-L)	See all
LaRS	WaSR-T (ResNet-101)	See all
Multispectral Video Semantic Segmentation	MVNet(DeepLabV3)	See all

Libraries

Use these libraries to find Video Semantic Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

560

PaddlePaddle/PaddleSeg

3 papers

8,226

visionml/pytracking

3 papers

3,080

hkchengrex/Mask-Propagation

3 papers

124

See all 9 libraries.

Datasets

Subtasks

Camera shot segmentation

Most implemented papers

Most implemented Social Latest No code

Pyramid Scene Parsing Network

hszhao/PSPNet • • CVPR 2017

Scene parsing is challenging for unrestricted open vocabulary and diverse scenes.

Paper
Code

Fully Convolutional Networks for Semantic Segmentation

pytorch/vision • • CVPR 2015

Convolutional networks are powerful visual models that yield hierarchies of features.

Paper
Code

PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation

JonathonLuiten/PReMVOS • • 24 Jul 2018

We address semi-supervised video object segmentation, the task of automatically generating accurate and consistent pixel masks for objects in a video sequence, given the first-frame ground truth annotations.

Paper
Code

Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

hkchengrex/MiVOS • • CVPR 2021

We present Modular interactive VOS (MiVOS) framework which decouples interaction-to-mask and mask propagation, allowing for higher generalizability and better performance.

Paper
Code

Rethinking Self-supervised Correspondence Learning: A Video Frame-level Similarity Perspective

xvjiarui/VFS • • ICCV 2021

To learn generalizable representation for correspondence in large-scale, a variety of self-supervised pretext tasks are proposed to explicitly perform object-level or patch-level similarity learning.

Paper
Code

Mask2Former for Video Instance Segmentation

facebookresearch/Mask2Former • • 20 Dec 2021

We find Mask2Former also achieves state-of-the-art performance on video instance segmentation without modifying the architecture, the loss or even the training pipeline.

Paper
Code

Lucid Data Dreaming for Video Object Segmentation

omkar13/MaskTrack • • 28 Mar 2017

Our approach is suitable for both single and multiple object segmentation.

Paper
Code

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

BehradToghi/ECCV_Youtube_VOS • • ECCV 2018

End-to-end sequential learning to explore spatial-temporal features for video segmentation is largely limited by the scale of available video segmentation datasets, i. e., even the largest video segmentation dataset only contains 90 short video clips.

Paper
Code

CCNet: Criss-Cross Attention for Semantic Segmentation

speedinghzl/CCNet • • ICCV 2019

Compared with the non-local block, the proposed recurrent criss-cross attention module requires 11x less GPU memory usage.

Paper
Code

Interactive Video Object Segmentation Using Global and Local Transfer Modules

yuk6heo/IVOS-ATNet • • ECCV 2020

The global transfer module conveys the segmentation information in an annotated frame to a target frame, while the local transfer module propagates the segmentation information in a temporally adjacent frame to the target frame.

Paper
Code

Video Semantic Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result