TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semi-Supervised Video Object Segmentation	DAVIS 2016	SWEM (val)	Jaccard (Mean)	87.3	# 40
Semi-Supervised Video Object Segmentation	DAVIS 2016	SWEM (val)	F-measure (Mean)	89.0	# 40
Semi-Supervised Video Object Segmentation	DAVIS 2016	SWEM (val)	J&F	88.1	# 41
Semi-Supervised Video Object Segmentation	DAVIS 2016	SWEM (val)	Speed (FPS)	36	# 12
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	SWEM	Jaccard (Mean)	74.5	# 45
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	SWEM	F-measure (Mean)	79.8	# 49
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	SWEM	J&F	77.2	# 49
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	SWEM	FPS	36.0	# 5
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	SWEM	D16 val (G)	88.1	# 2
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	SWEM	D16 val (J)	87.3	# 4
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	SWEM	D16 val (F)	89.0	# 2
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	SWEM	D17 val (G)	77.2	# 6
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	SWEM	D17 val (J)	74.5	# 6
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	SWEM	D17 val (F)	79.8	# 6
Semi-Supervised Video Object Segmentation	MOSE	SWEM	J&F	50.9	# 14
Semi-Supervised Video Object Segmentation	MOSE	SWEM	J	46.8	# 14
Semi-Supervised Video Object Segmentation	MOSE	SWEM	F	54.9	# 15

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/swem-towards-real-time-video-object-1/semi-supervised-video-object-segmentation-on-20)](https://paperswithcode.com/sota/semi-supervised-video-object-segmentation-on-20?p=swem-towards-real-time-video-object-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/swem-towards-real-time-video-object-1/semi-supervised-video-object-segmentation-on-21)](https://paperswithcode.com/sota/semi-supervised-video-object-segmentation-on-21?p=swem-towards-real-time-video-object-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/swem-towards-real-time-video-object-1/visual-object-tracking-on-davis-2016)](https://paperswithcode.com/sota/visual-object-tracking-on-davis-2016?p=swem-towards-real-time-video-object-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/swem-towards-real-time-video-object-1/visual-object-tracking-on-davis-2017)](https://paperswithcode.com/sota/visual-object-tracking-on-davis-2017?p=swem-towards-real-time-video-object-1)`

SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization

CVPR 2022 · Zhihui Lin, Tianyu Yang, Maomao Li, Ziyu Wang, Chun Yuan, Wenhao Jiang, Wei Liu ·

Matching-based methods, especially those based on space-time memory, are significantly ahead of other solutions in semi-supervised video object segmentation (VOS). However, continuously growing and redundant template features lead to an inefficient inference. To alleviate this, we propose a novel Sequential Weighted Expectation-Maximization (SWEM) network to greatly reduce the redundancy of memory features. Different from the previous methods which only detect feature redundancy between frames, SWEM merges both intra-frame and inter-frame similar features by leveraging the sequential weighted EM algorithm. Further, adaptive weights for frame features endow SWEM with the flexibility to represent hard samples, improving the discrimination of templates. Besides, the proposed method maintains a fixed number of template features in memory, which ensures the stable inference complexity of the VOS system. Extensive experiments on commonly used DAVIS and YouTube-VOS datasets verify the high efficiency (36 FPS) and high performance (84.3\% $\mathcal{J}\&\mathcal{F}$ on DAVIS 2017 validation dataset) of SWEM. Code is available at: https://github.com/lmm077/SWEM.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

lmm077/SWEM official

Tasks

Add Remove

Semantic Segmentation

Semi-Supervised Video Object Segmentation

Video Object Segmentation

Video Semantic Segmentation

Datasets

DAVIS

DAVIS 2017

DAVIS 2016

Referring Expressions for DAVIS 2016 & 2017

MOSE

Results from the Paper

Edit

Ranked #6 on Semi-Supervised Video Object Segmentation on DAVIS (no YouTube-VOS training)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semi-Supervised Video Object Segmentation	DAVIS 2016	SWEM (val)	Jaccard (Mean)	87.3	# 40	Compare
			F-measure (Mean)	89.0	# 40	Compare
			J&F	88.1	# 41	Compare
			Speed (FPS)	36	# 12	Compare
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	SWEM	Jaccard (Mean)	74.5	# 45	Compare
			F-measure (Mean)	79.8	# 49	Compare
			J&F	77.2	# 49	Compare
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	SWEM	FPS	36.0	# 5	Compare
			D16 val (G)	88.1	# 2	Compare
			D16 val (J)	87.3	# 4	Compare
			D16 val (F)	89.0	# 2	Compare
			D17 val (G)	77.2	# 6	Compare
			D17 val (J)	74.5	# 6	Compare
			D17 val (F)	79.8	# 6	Compare
Semi-Supervised Video Object Segmentation	MOSE	SWEM	J&F	50.9	# 14	Compare
			J	46.8	# 14	Compare
			F	54.9	# 15	Compare

Methods

Add Remove

VOS

Edit Social Preview

SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove