TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Video Object Segmentation	DAVIS 2016 val	AGS	G	78.6	# 21
Unsupervised Video Object Segmentation	DAVIS 2016 val	AGS	J	79.7	# 21
Unsupervised Video Object Segmentation	DAVIS 2016 val	AGS	F	77.4	# 20
Unsupervised Video Object Segmentation	DAVIS 2017 (test-dev)	AGS	J&F	45.6	# 3
Unsupervised Video Object Segmentation	DAVIS 2017 (test-dev)	AGS	Jaccard (Mean)	42.1	# 2
Unsupervised Video Object Segmentation	DAVIS 2017 (test-dev)	AGS	Jaccard (Recall)	48.5	# 2
Unsupervised Video Object Segmentation	DAVIS 2017 (test-dev)	AGS	Jaccard (Decay)	2.6	# 2
Unsupervised Video Object Segmentation	DAVIS 2017 (test-dev)	AGS	F-measure (Mean)	49.0	# 2
Unsupervised Video Object Segmentation	DAVIS 2017 (test-dev)	AGS	F-measure (Recall)	51.5	# 2
Unsupervised Video Object Segmentation	DAVIS 2017 (test-dev)	AGS	F-measure (Decay)	2.6	# 2
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	AGS	J&F	57.5	# 8
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	AGS	Jaccard (Mean)	55.5	# 8
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	AGS	Jaccard (Recall)	61.6	# 6
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	AGS	F-measure (Mean)	59.5	# 8
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	AGS	F-measure (Recall)	62.8	# 6
Unsupervised Video Object Segmentation	YouTube-Objects	AGS	J	69.7	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-unsupervised-video-object/unsupervised-video-object-segmentation-on-5)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-5?p=learning-unsupervised-video-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-unsupervised-video-object/unsupervised-video-object-segmentation-on-4)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-4?p=learning-unsupervised-video-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-unsupervised-video-object/unsupervised-video-object-segmentation-on-12)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-12?p=learning-unsupervised-video-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-unsupervised-video-object/unsupervised-video-object-segmentation-on-10)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-10?p=learning-unsupervised-video-object)`

Learning Unsupervised Video Object Segmentation Through Visual Attention

CVPR 2019 · Wenguan Wang, Hongmei Song, Shuyang Zhao, Jianbing Shen, Sanyuan Zhao, Steven C. H. Hoi, Haibin Ling ·

This paper conducts a systematic study on the role of visual attention in Unsupervised Video Object Segmentation (UVOS) tasks. By elaborately annotating three popular video segmentation datasets (DAVIS, Youtube-Objects and SegTrack V2) with dynamic eye-tracking data in the UVOS setting, for the first time, we quantitatively verified the high consistency of visual attention behavior among human observers, and found strong correlation between human attention and explicit primary object judgements during dynamic, task-driven viewing. Such novel observations provide an in-depth insight into the underlying rationale behind UVOS. Inspired by these findings, we decouple UVOS into two sub-tasks: UVOS-driven Dynamic Visual Attention Prediction (DVAP) in spatiotemporal domain, and Attention-Guided Object Segmentation (AGOS) in spatial domain. Our UVOS solution enjoys three major merits: 1) modular training without using expensive video segmentation annotations, instead, using more affordable dynamic fixation data to train the initial video attention module and using existing fixation-segmentation paired static/image data to train the subsequent segmentation module; 2) comprehensive foreground understanding through multi-source learning; and 3) additional interpretability from the biologically-inspired and assessable attention. Experiments on popular benchmarks show that, even without using expensive video object mask annotations, our model achieves compelling performance in comparison with state-of-the-arts.

PDF Abstract

Code

Add Remove Mark official

wenguanwang/AGS official

209

Tasks

Add Remove

Object

Segmentation

Semantic Segmentation

Unsupervised Video Object Segmentation

Video Object Segmentation

Video Segmentation

Video Semantic Segmentation

Datasets

DAVIS

DAVIS 2017

DAVIS 2016

Referring Expressions for DAVIS 2016 & 2017

Results from the Paper

Add Remove

Ranked #3 on Unsupervised Video Object Segmentation on DAVIS 2017 (test-dev)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Video Object Segmentation	DAVIS 2016 val	AGS	G	78.6	# 21	Compare
			J	79.7	# 21	Compare
			F	77.4	# 20	Compare
Unsupervised Video Object Segmentation	DAVIS 2017 (test-dev)	AGS	J&F	45.6	# 3	Compare
			Jaccard (Mean)	42.1	# 2	Compare
			Jaccard (Recall)	48.5	# 2	Compare
			Jaccard (Decay)	2.6	# 2	Compare
			F-measure (Mean)	49.0	# 2	Compare
			F-measure (Recall)	51.5	# 2	Compare
			F-measure (Decay)	2.6	# 2	Compare
Unsupervised Video Object Segmentation	DAVIS 2017 (val)	AGS	J&F	57.5	# 8	Compare
			Jaccard (Mean)	55.5	# 8	Compare
			Jaccard (Recall)	61.6	# 6	Compare
			F-measure (Mean)	59.5	# 8	Compare
			F-measure (Recall)	62.8	# 6	Compare
Unsupervised Video Object Segmentation	YouTube-Objects	AGS	J	69.7	# 8	Compare

Methods

Add Remove

Interpretability

Edit Social Preview

Learning Unsupervised Video Object Segmentation Through Visual Attention

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove