TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	DMM-Net	D17 val (G)	70.7	# 18
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	DMM-Net	D17 val (J)	68.1	# 18
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	DMM-Net	D17 val (F)	73.3	# 18

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dmm-net-differentiable-mask-matching-network/semi-supervised-video-object-segmentation-on-20)](https://paperswithcode.com/sota/semi-supervised-video-object-segmentation-on-20?p=dmm-net-differentiable-mask-matching-network)`

DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation

ICCV 2019 · Xiaohui Zeng, Renjie Liao, Li Gu, Yuwen Xiong, Sanja Fidler, Raquel Urtasun ·

In this paper, we propose the differentiable mask-matching network (DMM-Net) for solving the video object segmentation problem where the initial object masks are provided. Relying on the Mask R-CNN backbone, we extract mask proposals per frame and formulate the matching between object templates and proposals at one time step as a linear assignment problem where the cost matrix is predicted by a CNN. We propose a differentiable matching layer by unrolling a projected gradient descent algorithm in which the projection exploits the Dykstra's algorithm. We prove that under mild conditions, the matching is guaranteed to converge to the optimum. In practice, it performs similarly to the Hungarian algorithm during inference. Meanwhile, we can back-propagate through it to learn the cost matrix. After matching, a refinement head is leveraged to improve the quality of the matched mask. Our DMM-Net achieves competitive results on the largest video object segmentation dataset YouTube-VOS. On DAVIS 2017, DMM-Net achieves the best performance without online learning on the first frames. Without any fine-tuning, DMM-Net performs comparably to state-of-the-art methods on SegTrack v2 dataset. At last, our matching layer is very simple to implement; we attach the PyTorch code ($<50$ lines) in the supplementary material. Our code is released at https://github.com/ZENGXH/DMM_Net.

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract

Code

Add Remove Mark official

ZENGXH/DMM_Net official

147

Tasks

Add Remove

Object

One-shot visual object segmentation

Rolling Shutter Correction

Semantic Segmentation

Semi-Supervised Video Object Segmentation

Video Object Segmentation

Video Semantic Segmentation

Datasets

MS COCO

DAVIS

DAVIS 2017

YouTube-VOS 2018

SegTrack-v2

Results from the Paper

Edit

Ranked #18 on Semi-Supervised Video Object Segmentation on DAVIS (no YouTube-VOS training)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	DMM-Net	D17 val (G)	70.7	# 18	Compare
			D17 val (J)	68.1	# 18	Compare
			D17 val (F)	73.3	# 18	Compare

Methods

Add Remove

Convolution • Mask R-CNN • RoIAlign • RPN • Softmax

Edit Social Preview

DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove