TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Object Detection	ScanNetV2	3DETR-m	mAP@0.25	65.0	# 18
3D Object Detection	ScanNetV2	3DETR-m	mAP@0.5	47.0	# 19
3D Object Detection	SUN-RGBD val	3DETR-m	mAP@0.25	59.1	# 20
3D Object Detection	SUN-RGBD val	3DETR-m	mAP@0.5	32.7	# 20

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-end-to-end-transformer-model-for-3d-object/3d-object-detection-on-scannetv2)](https://paperswithcode.com/sota/3d-object-detection-on-scannetv2?p=an-end-to-end-transformer-model-for-3d-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-end-to-end-transformer-model-for-3d-object/3d-object-detection-on-sun-rgbd-val)](https://paperswithcode.com/sota/3d-object-detection-on-sun-rgbd-val?p=an-end-to-end-transformer-model-for-3d-object)`

An End-to-End Transformer Model for 3D Object Detection

ICCV 2021 · Ishan Misra, Rohit Girdhar, Armand Joulin ·

We propose 3DETR, an end-to-end Transformer based object detection model for 3D point clouds. Compared to existing detection methods that employ a number of 3D-specific inductive biases, 3DETR requires minimal modifications to the vanilla Transformer block. Specifically, we find that a standard Transformer with non-parametric queries and Fourier positional embeddings is competitive with specialized architectures that employ libraries of 3D-specific operators with hand-tuned hyperparameters. Nevertheless, 3DETR is conceptually simple and easy to implement, enabling further improvements by incorporating 3D domain knowledge. Through extensive experiments, we show 3DETR outperforms the well-established and highly optimized VoteNet baselines on the challenging ScanNetV2 dataset by 9.5%. Furthermore, we show 3DETR is applicable to 3D tasks beyond detection, and can serve as a building block for future research.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Code

Add Remove Mark official

facebookresearch/3detr

589

Tasks

Add Remove

3D Object Detection

object-detection

Object Detection

Datasets

ScanNet

SUN RGB-D

Results from the Paper

Add Remove

Ranked #18 on 3D Object Detection on ScanNetV2

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Object Detection	ScanNetV2	3DETR-m	mAP@0.25	65.0	# 18	Compare
3D Object Detection	ScanNetV2	3DETR-m	mAP@0.5	47.0	# 19	Compare
3D Object Detection	SUN-RGBD val	3DETR-m	mAP@0.25	59.1	# 20	Compare
3D Object Detection	SUN-RGBD val	3DETR-m	mAP@0.5	32.7	# 20	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

An End-to-End Transformer Model for 3D Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove