TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	MATNet	J&F	58.6	# 6
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	MATNet	Jaccard (Mean)	56.7	# 6
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	MATNet	Jaccard (Recall)	65.2	# 4
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	MATNet	F-measure (Mean)	60.4	# 6
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	MATNet	F-measure (Recall)	68.2	# 4
Video Polyp Segmentation	SUN-SEG-Easy (Unseen)	MAT	S measure	0.770	# 5
Video Polyp Segmentation	SUN-SEG-Easy (Unseen)	MAT	mean E-measure	0.737	# 8
Video Polyp Segmentation	SUN-SEG-Easy (Unseen)	MAT	weighted F-measure	0.575	# 6
Video Polyp Segmentation	SUN-SEG-Easy (Unseen)	MAT	mean F-measure	0.641	# 6
Video Polyp Segmentation	SUN-SEG-Easy (Unseen)	MAT	Dice	0.710	# 5
Video Polyp Segmentation	SUN-SEG-Easy (Unseen)	MAT	Sensitivity	0.542	# 6
Video Polyp Segmentation	SUN-SEG-Hard (Unseen)	MAT	S-Measure	0.785	# 4
Video Polyp Segmentation	SUN-SEG-Hard (Unseen)	MAT	mean E-measure	0.755	# 5
Video Polyp Segmentation	SUN-SEG-Hard (Unseen)	MAT	weighted F-measure	0.578	# 6
Video Polyp Segmentation	SUN-SEG-Hard (Unseen)	MAT	mean F-measure	0.645	# 6
Video Polyp Segmentation	SUN-SEG-Hard (Unseen)	MAT	Dice	0.712	# 3
Video Polyp Segmentation	SUN-SEG-Hard (Unseen)	MAT	Sensitivity	0.579	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/matnet-motion-attentive-transition-network/video-polyp-segmentation-on-sun-seg-hard)](https://paperswithcode.com/sota/video-polyp-segmentation-on-sun-seg-hard?p=matnet-motion-attentive-transition-network)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/matnet-motion-attentive-transition-network/video-polyp-segmentation-on-sun-seg-easy)](https://paperswithcode.com/sota/video-polyp-segmentation-on-sun-seg-easy?p=matnet-motion-attentive-transition-network)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/matnet-motion-attentive-transition-network/unsupervised-video-object-segmentation-on-4)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-4?p=matnet-motion-attentive-transition-network)`

MATNet: Motion-Attentive Transition Network for Zero-Shot Video Object Segmentation

IEEE Transactions on Image Processing 2020 · Zhou, Tianfei; Li, Jianwu; Wang, Shunzhou; Tao, Ran; Shen, Jianbing ·

In this paper, we present a novel end-to-end learning neural network, i.e., MATNet, for zero-shot video object segmentation (ZVOS). Motivated by the human visual attention behavior, MATNet leverages motion cues as a bottom-up signal to guide the perception of object appearance. To achieve this, an asymmetric attention block, named Motion-Attentive Transition (MAT), is proposed within a two-stream encoder network to firstly identify moving regions and then attend appearance learning to capture the full extent of objects. Putting MATs in different convolutional layers, our encoder becomes deeply interleaved, allowing for close hierarchical interactions between object apperance and motion. Such a biologically-inspired design is proven to be superb to conventional two-stream structures, which treat motion and appearance independently in separate streams and often suffer severe overfitting to object appearance. Moreover, we introduce a bridge network to modulate multi-scale spatiotemporal features into more compact, discriminative and scale-sensitive representations, which are subsequently fed into a boundary-aware decoder network to produce accurate segmentation with crisp boundaries. We perform extensive quantitative and qualitative experiments on four challenging public benchmarks, i.e., DAVIS16, DAVIS17, FBMS and YouTube-Objects. Results show that our method achieves compelling performance against current state-of-the-art ZVOS methods. To further demonstrate the generalization ability of our spatiotemporal learning framework, we extend MATNet to another relevant task: dynamic visual attention prediction (DVAP). The experiments on two popular datasets (i.e., Hollywood-2 and UCF-Sports) further verify the superiority of our model.

PDF

Code

Add Remove Mark official

tfzhou/MATNet

190

Tasks

Add Remove

Object

Semantic Segmentation

Unsupervised Video Object Segmentation

Video Object Segmentation

Video Polyp Segmentation

Video Semantic Segmentation

Zero-Shot Video Object Segmentation

Datasets

DAVIS 2017

Referring Expressions for DAVIS 2016 & 2017 SUN-SEG-Hard (Unseen) SUN-SEG-Easy (Unseen)

Results from the Paper

Add Remove

Ranked #4 on Video Polyp Segmentation on SUN-SEG-Hard (Unseen)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	MATNet	J&F	58.6	# 6	Compare
			Jaccard (Mean)	56.7	# 6	Compare
			Jaccard (Recall)	65.2	# 4	Compare
			F-measure (Mean)	60.4	# 6	Compare
			F-measure (Recall)	68.2	# 4	Compare
Video Polyp Segmentation	SUN-SEG-Easy (Unseen)	MAT	S measure	0.770	# 5	Compare
			mean E-measure	0.737	# 8	Compare
			weighted F-measure	0.575	# 6	Compare
			mean F-measure	0.641	# 6	Compare
			Dice	0.710	# 5	Compare
			Sensitivity	0.542	# 6	Compare
Video Polyp Segmentation	SUN-SEG-Hard (Unseen)	MAT	S-Measure	0.785	# 4	Compare
			mean E-measure	0.755	# 5	Compare
			weighted F-measure	0.578	# 6	Compare
			mean F-measure	0.645	# 6	Compare
			Dice	0.712	# 3	Compare
			Sensitivity	0.579	# 5	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

MATNet: Motion-Attentive Transition Network for Zero-Shot Video Object Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove