TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Open-Vocabulary Instance Segmentation	Replica	OpenMask3D	mAP	13.1	# 3
3D Open-Vocabulary Instance Segmentation	ScanNet200	OpenMask3D	mAP	15.4	# 2
3D Open-Vocabulary Instance Segmentation	ScanNet200	OpenMask3D	AP50	19.9	# 2
3D Open-Vocabulary Instance Segmentation	ScanNet200	OpenMask3D	AP25	23.1	# 2
3D Open-Vocabulary Instance Segmentation	ScanNet200	OpenMask3D	AP Head	17.1	# 2
3D Open-Vocabulary Instance Segmentation	ScanNet200	OpenMask3D	AP Common	14.1	# 2
3D Open-Vocabulary Instance Segmentation	ScanNet200	OpenMask3D	AP Tail	14.9	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/openmask3d-open-vocabulary-3d-instance/3d-open-vocabulary-instance-segmentation-on)](https://paperswithcode.com/sota/3d-open-vocabulary-instance-segmentation-on?p=openmask3d-open-vocabulary-3d-instance)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/openmask3d-open-vocabulary-3d-instance/3d-open-vocabulary-instance-segmentation-on-1)](https://paperswithcode.com/sota/3d-open-vocabulary-instance-segmentation-on-1?p=openmask3d-open-vocabulary-3d-instance)`

OpenMask3D: Open-Vocabulary 3D Instance Segmentation

NeurIPS 2023 · Ayça Takmaz, Elisabetta Fedele, Robert W. Sumner, Marc Pollefeys, Federico Tombari, Francis Engelmann ·

We introduce the task of open-vocabulary 3D instance segmentation. Current approaches for 3D instance segmentation can typically only recognize object categories from a pre-defined closed set of classes that are annotated in the training datasets. This results in important limitations for real-world applications where one might need to perform tasks guided by novel, open-vocabulary queries related to a wide variety of objects. Recently, open-vocabulary 3D scene understanding methods have emerged to address this problem by learning queryable features for each point in the scene. While such a representation can be directly employed to perform semantic segmentation, existing methods cannot separate multiple object instances. In this work, we address this limitation, and propose OpenMask3D, which is a zero-shot approach for open-vocabulary 3D instance segmentation. Guided by predicted class-agnostic 3D instance masks, our model aggregates per-mask features via multi-view fusion of CLIP-based image embeddings. Experiments and ablation studies on ScanNet200 and Replica show that OpenMask3D outperforms other open-vocabulary methods, especially on the long-tail distribution. Qualitative experiments further showcase OpenMask3D's ability to segment object properties based on free-form queries describing geometry, affordances, and materials.

PDF Abstract NeurIPS 2023 PDF NeurIPS 2023 Abstract

Code

Add Remove Mark official

OpenMask3D/openmask3d official

145

Tasks

Add Remove

3D Instance Segmentation

3D Open-Vocabulary Instance Segmentation

Instance Segmentation

Object

open vocabulary 3d instance segmentation

Scene Understanding

Segmentation

Semantic Segmentation

Datasets

ScanNet

Replica ScanNet200

Results from the Paper

Add Remove

Ranked #2 on 3D Open-Vocabulary Instance Segmentation on ScanNet200

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Open-Vocabulary Instance Segmentation	Replica	OpenMask3D	mAP	13.1	# 3	Compare
3D Open-Vocabulary Instance Segmentation	ScanNet200	OpenMask3D	mAP	15.4	# 2	Compare
			AP50	19.9	# 2	Compare
			AP25	23.1	# 2	Compare
			AP Head	17.1	# 2	Compare
			AP Common	14.1	# 2	Compare
			AP Tail	14.9	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

OpenMask3D: Open-Vocabulary 3D Instance Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove