TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semi-Supervised Video Object Segmentation	DAVIS 2016	CINM	Jaccard (Mean)	83.4	# 55
Semi-Supervised Video Object Segmentation	DAVIS 2016	CINM	Jaccard (Recall)	94.9	# 15
Semi-Supervised Video Object Segmentation	DAVIS 2016	CINM	Jaccard (Decay)	12.3	# 8
Semi-Supervised Video Object Segmentation	DAVIS 2016	CINM	F-measure (Mean)	85.0	# 51
Semi-Supervised Video Object Segmentation	DAVIS 2016	CINM	F-measure (Recall)	92.1	# 14
Semi-Supervised Video Object Segmentation	DAVIS 2016	CINM	F-measure (Decay)	14.7	# 5
Semi-Supervised Video Object Segmentation	DAVIS 2016	CINM	J&F	84.2	# 52
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	CINM	J&F	67.5	# 41
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	CINM	Jaccard (Mean)	64.5	# 41
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	CINM	Jaccard (Recall)	73.8	# 7
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	CINM	Jaccard (Decay)	20.0	# 10
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	CINM	F-measure (Mean)	70.5	# 42
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	CINM	F-measure (Recall)	79.6	# 8
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	CINM	F-measure (Decay)	20.0	# 10
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	CINM	Jaccard (Mean)	67.2	# 57
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	CINM	Jaccard (Recall)	74.5	# 13
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	CINM	Jaccard (Decay)	24.6	# 20
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	CINM	F-measure (Mean)	74.0	# 56
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	CINM	F-measure (Recall)	81.6	# 12
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	CINM	F-measure (Decay)	26.2	# 18
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	CINM	J&F	70.6	# 58
Semi-Supervised Video Object Segmentation	YouTube	MRFCNN	mIoU	0.784	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cnn-in-mrf-video-object-segmentation-via/video-object-segmentation-on-youtube)](https://paperswithcode.com/sota/video-object-segmentation-on-youtube?p=cnn-in-mrf-video-object-segmentation-via)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cnn-in-mrf-video-object-segmentation-via/semi-supervised-video-object-segmentation-on-1)](https://paperswithcode.com/sota/semi-supervised-video-object-segmentation-on-1?p=cnn-in-mrf-video-object-segmentation-via)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cnn-in-mrf-video-object-segmentation-via/visual-object-tracking-on-davis-2016)](https://paperswithcode.com/sota/visual-object-tracking-on-davis-2016?p=cnn-in-mrf-video-object-segmentation-via)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cnn-in-mrf-video-object-segmentation-via/visual-object-tracking-on-davis-2017)](https://paperswithcode.com/sota/visual-object-tracking-on-davis-2017?p=cnn-in-mrf-video-object-segmentation-via)`

CNN in MRF: Video Object Segmentation via Inference in A CNN-Based Higher-Order Spatio-Temporal MRF

CVPR 2018 · Linchao Bao, Baoyuan Wu, Wei Liu ·

This paper addresses the problem of video object segmentation, where the initial object mask is given in the first frame of an input video. We propose a novel spatio-temporal Markov Random Field (MRF) model defined over pixels to handle this problem. Unlike conventional MRF models, the spatial dependencies among pixels in our model are encoded by a Convolutional Neural Network (CNN). Specifically, for a given object, the probability of a labeling to a set of spatially neighboring pixels can be predicted by a CNN trained for this specific object. As a result, higher-order, richer dependencies among pixels in the set can be implicitly modeled by the CNN. With temporal dependencies established by optical flow, the resulting MRF model combines both spatial and temporal cues for tackling video object segmentation. However, performing inference in the MRF model is very difficult due to the very high-order dependencies. To this end, we propose a novel CNN-embedded algorithm to perform approximate inference in the MRF. This algorithm proceeds by alternating between a temporal fusion step and a feed-forward CNN step. When initialized with an appearance-based one-shot segmentation CNN, our model outperforms the winning entries of the DAVIS 2017 Challenge, without resorting to model ensembling or any dedicated detectors.

PDF Abstract CVPR 2018 PDF CVPR 2018 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Object

One-Shot Segmentation

Optical Flow Estimation

Segmentation

Semantic Segmentation

Semi-Supervised Video Object Segmentation

Video Object Segmentation

Video Semantic Segmentation

Datasets

DAVIS

DAVIS 2017

DAVIS 2016

SegTrack-v2

Referring Expressions for DAVIS 2016 & 2017

Results from the Paper

Edit

Ranked #3 on Semi-Supervised Video Object Segmentation on YouTube

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semi-Supervised Video Object Segmentation	DAVIS 2016	CINM	Jaccard (Mean)	83.4	# 55	Compare
			Jaccard (Recall)	94.9	# 15	Compare
			Jaccard (Decay)	12.3	# 8	Compare
			F-measure (Mean)	85.0	# 51	Compare
			F-measure (Recall)	92.1	# 14	Compare
			F-measure (Decay)	14.7	# 5	Compare
			J&F	84.2	# 52	Compare
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	CINM	J&F	67.5	# 41	Compare
			Jaccard (Mean)	64.5	# 41	Compare
			Jaccard (Recall)	73.8	# 7	Compare
			Jaccard (Decay)	20.0	# 10	Compare
			F-measure (Mean)	70.5	# 42	Compare
			F-measure (Recall)	79.6	# 8	Compare
			F-measure (Decay)	20.0	# 10	Compare
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	CINM	Jaccard (Mean)	67.2	# 57	Compare
			Jaccard (Recall)	74.5	# 13	Compare
			Jaccard (Decay)	24.6	# 20	Compare
			F-measure (Mean)	74.0	# 56	Compare
			F-measure (Recall)	81.6	# 12	Compare
			F-measure (Decay)	26.2	# 18	Compare
			J&F	70.6	# 58	Compare
Semi-Supervised Video Object Segmentation	YouTube	MRFCNN	mIoU	0.784	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

CNN in MRF: Video Object Segmentation via Inference in A CNN-Based Higher-Order Spatio-Temporal MRF

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove