TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Object Segmentation	DAVIS 2016	RCF (without Post-Processing)	J score	80.9	# 2
Unsupervised Object Segmentation	DAVIS 2016	RCF (with Post-Processing)	J score	83.0	# 1
Unsupervised Object Segmentation	FBMS-59	RCF (with post-processing)	mIoU	72.4	# 1
Unsupervised Object Segmentation	FBMS-59	RCF (without post-processing)	mIoU	69.9	# 2
Unsupervised Object Segmentation	SegTrack-v2	RCF (without post-processing)	mIoU	76.7	# 2
Unsupervised Object Segmentation	SegTrack-v2	RCF (with post-processing)	mIoU	79.6	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bootstrapping-objectness-from-videos-by/unsupervised-object-segmentation-on-davis)](https://paperswithcode.com/sota/unsupervised-object-segmentation-on-davis?p=bootstrapping-objectness-from-videos-by)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bootstrapping-objectness-from-videos-by/unsupervised-object-segmentation-on-fbms-59)](https://paperswithcode.com/sota/unsupervised-object-segmentation-on-fbms-59?p=bootstrapping-objectness-from-videos-by)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bootstrapping-objectness-from-videos-by/unsupervised-object-segmentation-on-segtrack)](https://paperswithcode.com/sota/unsupervised-object-segmentation-on-segtrack?p=bootstrapping-objectness-from-videos-by)`

Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping

CVPR 2023 · Long Lian, Zhirong Wu, Stella X. Yu ·

We study learning object segmentation from unlabeled videos. Humans can easily segment moving objects without knowing what they are. The Gestalt law of common fate, i.e., what move at the same speed belong together, has inspired unsupervised object discovery based on motion segmentation. However, common fate is not a reliable indicator of objectness: Parts of an articulated / deformable object may not move at the same speed, whereas shadows / reflections of an object always move with it but are not part of it. Our insight is to bootstrap objectness by first learning image features from relaxed common fate and then refining them based on visual appearance grouping within the image itself and across images statistically. Specifically, we learn an image segmenter first in the loop of approximating optical flow with constant segment flow plus small within-segment residual flow, and then by refining it for more coherent appearance and statistical figure-ground relevance. On unsupervised video object segmentation, using only ResNet and convolutional heads, our model surpasses the state-of-the-art by absolute gains of 7/9/5% on DAVIS16 / STv2 / FBMS59 respectively, demonstrating the effectiveness of our ideas. Our code is publicly available.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

TonyLianLong/RCF-UnsupVideoSeg official

Tasks

Add Remove

Motion Segmentation

Object

Object Discovery

Optical Flow Estimation

Segmentation

Semantic Segmentation

Unsupervised Object Segmentation

Unsupervised Video Object Segmentation

Video Object Segmentation

Video Semantic Segmentation

Datasets

DAVIS

DAVIS 2016

FBMS

SegTrack-v2

FBMS-59

Results from the Paper

Edit

Ranked #1 on Unsupervised Object Segmentation on FBMS-59

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Object Segmentation	DAVIS 2016	RCF (without Post-Processing)	J score	80.9	# 2	Compare
Unsupervised Object Segmentation	DAVIS 2016	RCF (with Post-Processing)	J score	83.0	# 1	Compare
Unsupervised Object Segmentation	FBMS-59	RCF (with post-processing)	mIoU	72.4	# 1	Compare
Unsupervised Object Segmentation	FBMS-59	RCF (without post-processing)	mIoU	69.9	# 2	Compare
Unsupervised Object Segmentation	SegTrack-v2	RCF (without post-processing)	mIoU	76.7	# 2	Compare
Unsupervised Object Segmentation	SegTrack-v2	RCF (with post-processing)	mIoU	79.6	# 1	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet • SPEED

Edit Social Preview

Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove