TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Object Segmentation	DAVIS 2016	MOD	J score	73.9	# 3
Unsupervised Object Segmentation	FBMS-59	MOD	mIoU	61.3	# 4
Unsupervised Object Segmentation	SegTrack-v2	MOD	mIoU	62.2	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-inductive-self-supervised-object/unsupervised-object-segmentation-on-davis)](https://paperswithcode.com/sota/unsupervised-object-segmentation-on-davis?p=motion-inductive-self-supervised-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-inductive-self-supervised-object/unsupervised-object-segmentation-on-fbms-59)](https://paperswithcode.com/sota/unsupervised-object-segmentation-on-fbms-59?p=motion-inductive-self-supervised-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-inductive-self-supervised-object/unsupervised-object-segmentation-on-segtrack)](https://paperswithcode.com/sota/unsupervised-object-segmentation-on-segtrack?p=motion-inductive-self-supervised-object)`

Motion-inductive Self-supervised Object Discovery in Videos

1 Oct 2022 · Shuangrui Ding, Weidi Xie, Yabo Chen, Rui Qian, Xiaopeng Zhang, Hongkai Xiong, Qi Tian ·

In this paper, we consider the task of unsupervised object discovery in videos. Previous works have shown promising results via processing optical flows to segment objects. However, taking flow as input brings about two drawbacks. First, flow cannot capture sufficient cues when objects remain static or partially occluded. Second, it is challenging to establish temporal coherency from flow-only input, due to the missing texture information. To tackle these limitations, we propose a model for directly processing consecutive RGB frames, and infer the optical flow between any pair of frames using a layered representation, with the opacity channels being treated as the segmentation. Additionally, to enforce object permanence, we apply temporal consistency loss on the inferred masks from randomly-paired frames, which refer to the motions at different paces, and encourage the model to segment the objects even if they may not move at the current time point. Experimentally, we demonstrate superior performance over previous state-of-the-art methods on three public video segmentation datasets (DAVIS2016, SegTrackv2, and FBMS-59), while being computationally efficient by avoiding the overhead of computing optical flow as input.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Object

Object Discovery

Object Discovery In Videos

Optical Flow Estimation

Unsupervised Object Segmentation

Video Segmentation

Video Semantic Segmentation

Datasets

DAVIS

DAVIS 2016

FBMS

SegTrack-v2

FBMS-59

Results from the Paper

Edit

Ranked #3 on Unsupervised Object Segmentation on DAVIS 2016

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Object Segmentation	DAVIS 2016	MOD	J score	73.9	# 3	Compare
Unsupervised Object Segmentation	FBMS-59	MOD	mIoU	61.3	# 4	Compare
Unsupervised Object Segmentation	SegTrack-v2	MOD	mIoU	62.2	# 5	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Motion-inductive Self-supervised Object Discovery in Videos

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove