3D Object Detection

590 papers with code • 55 benchmarks • 48 datasets

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Benchmarks

Add a Result

These leaderboards are used to track progress in 3D Object Detection

Dataset	Best Model	Compare
KITTI Cars Moderate	GLENet-VR	See all
nuScenes	EA-LSS	See all
SUN-RGBD val	Point-GCC+TR3D+FF	See all
ScanNetV2	UDeerLvic	See all
KITTI Cars Easy	GLENet-VR	See all
KITTI Cars Hard	3D Dual-Fusion	See all
nuScenes Camera Only	Far3D	See all
KITTI Pedestrians Moderate	3D-FCT	See all
KITTI Cyclists Easy	3D-FCT	See all
KITTI Cyclists Moderate	3D-FCT	See all
KITTI Cyclists Hard	SA-Det3D	See all
KITTI Cars Easy val	SA-SSD+EBM	See all
KITTI Cars Moderate val	SA-SSD+EBM	See all
KITTI Cars Hard val	M3DeTR	See all
KITTI Pedestrians Easy	IPOD	See all
KITTI Pedestrians Hard	SVGA-Net	See all
DAIR-V2X-I	MonoUNI	See all
nuscenes Camera-Radar	HyDRa	See all
waymo vehicle	PillarNeXt	See all
Rope3D	MonoUNI	See all
SUN-RGBD	CAGroup3D (Geo Only)	See all
waymo cyclist	DSVT(val)	See all
waymo pedestrian	DSVT(val)	See all
S3DIS	Point-GCC+TR3D	See all
V2XSet	V2X-ViT	See all
nuScenes LiDAR only	DSVT	See all
OPV2V	V2VNet (PointPillar backbone)	See all
KITTI Pedestrian Easy val	PVCNN	See all
KITTI Pedestrian Moderate val	PVCNN	See all
KITTI Pedestrian Hard val	PVCNN	See all
KITTI Cyclist Easy val	PVCNN	See all
KITTI Cyclist Moderate val	PVCNN	See all
KITTI Cyclist Hard val	F-PointNet++ [Qi:2018fd]	See all
aiMotive Dataset	Lidar-Radar-Camera	See all
3D Object Detection on Argoverse2 Camera Only	Far3D	See all
waymo all_ns	CenterPoint	See all
NYU Depth v2	SGPN-CNN	See all
nuScenes-F	RRPN + R101 - F	See all
nuScenes-FB	RRPN + R101 - FB	See all
KITTI Pedestrian Hard	PiFeNet	See all
KITTI Cyclists Moderate val	Deformable PV-RCNN	See all
KITTI Pedestrians Moderate val	Deformable PV-RCNN	See all
Dense Fog	PV-RCNN	See all
KITTI Pedestrian Moderate	PiFeNet	See all
Heavy Snowfall	PV-RCNN	See all
Light Snowfall	PV-RCNN	See all
Clear Weather	PV-RCNN	See all
KITTI Pedestrian Easy	PiFeNet	See all
KITTI Pedestrian	PiFeNet	See all
V2X-SIM	Where2comm	See all
DAIR-V2X	CoBEVFlow	See all
Argoverse2	VoxelNeXt	See all
Cityscapes 3D	TaskPrompter	See all
Argoverse	ky_nctu_mo	See all
IRV2V	CoBEVFlow	See all

Show all 55 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find 3D Object Detection models and implementations

open-mmlab/mmdetection3d

14 papers

4,842

PaddlePaddle/Paddle3D

6 papers

541

open-mmlab/OpenPCDet

5 papers

4,342

DerrickXuNu/OpenCOOD

5 papers

601

See all 12 libraries.

Datasets

Subtasks

Robust 3D Object Detection

Robust BEV Detection

Latest papers

Most implemented Social Latest No code

CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images

sercharles/cn-rma • • 7 Mar 2024

This paper introduces CN-RMA, a novel approach for 3D indoor object detection from multi-view images.

07 Mar 2024

Paper
Code

Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation for Autonomous Driving

owen-liuyuxuan/visionfactory • • 4 Mar 2024

Collectively, these contributions lay a robust foundation for the widespread adoption of vision-based 3D perception technologies in autonomous driving applications.

04 Mar 2024

Paper
Code

Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection

xjtu-haolin/point_assisted_sample_selection • • 4 Mar 2024

3D object detection based on LiDAR point cloud and prior anchor boxes is a critical technology for autonomous driving environment perception and understanding.

04 Mar 2024

Paper
Code

TUMTraf V2X Cooperative Perception Dataset

walzimmer/3d-bat • 2 Mar 2024

We propose CoopDet3D, a cooperative multi-modal fusion model, and TUMTraf-V2X, a perception dataset, for the cooperative 3D object detection and tracking task.

576

02 Mar 2024

Paper
Code

Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection

ylwhxht/SRKD-DRET • • 28 Feb 2024

LiDAR-based 3D object detection models have traditionally struggled under rainy conditions due to the degraded and noisy scanning signals.

28 Feb 2024

Paper
Code

EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection

bosszhe/vimi • • 23 Feb 2024

In autonomous driving, cooperative perception makes use of multi-view cameras from both vehicles and infrastructure, providing a global vantage point with rich semantic context of road conditions beyond a single vehicle viewpoint.

23 Feb 2024

Paper
Code

MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object Detection

ika-rwth-aachen/multicorrupt • 18 Feb 2024

Multi-modal 3D object detection models for automated driving have demonstrated exceptional performance on computer vision benchmarks like nuScenes.

18 Feb 2024

Paper
Code

AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision Transformer

sanjay-810/aydiv2 • • 12 Feb 2024

Combining LiDAR and camera data has shown potential in enhancing short-distance object detection in autonomous driving systems.

12 Feb 2024

Paper
Code

Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection

liewfeng/beam • • 6 Feb 2024

Multi-view 3D object detection systems often struggle with generating precise predictions due to the challenges in estimating depth from images, increasing redundant and incorrect detections.

06 Feb 2024

Paper
Code

ActiveAnno3D - An Active Learning Framework for Multi-Modal 3D Object Detection

walzimmer/3d-bat • 5 Feb 2024

We propose ActiveAnno3D, an active learning framework to select data samples for labeling that are of maximum informativeness for training.

576

05 Feb 2024

Paper
Code

3D Object Detection

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result