3D Object Detection

585 papers with code • 55 benchmarks • 48 datasets

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Benchmarks

Add a Result

These leaderboards are used to track progress in 3D Object Detection

Dataset	Best Model	Compare
KITTI Cars Moderate	GLENet-VR	See all
nuScenes	EA-LSS	See all
SUN-RGBD val	Point-GCC+TR3D+FF	See all
ScanNetV2	UDeerLvic	See all
KITTI Cars Easy	GLENet-VR	See all
KITTI Cars Hard	3D Dual-Fusion	See all
nuScenes Camera Only	Far3D	See all
KITTI Pedestrians Moderate	3D-FCT	See all
KITTI Cyclists Easy	3D-FCT	See all
KITTI Cyclists Moderate	3D-FCT	See all
KITTI Cyclists Hard	SA-Det3D	See all
KITTI Cars Easy val	SA-SSD+EBM	See all
KITTI Cars Moderate val	SA-SSD+EBM	See all
KITTI Cars Hard val	M3DeTR	See all
KITTI Pedestrians Easy	IPOD	See all
KITTI Pedestrians Hard	SVGA-Net	See all
DAIR-V2X-I	MonoUNI	See all
nuscenes Camera-Radar	HyDRa	See all
waymo vehicle	PillarNeXt	See all
Rope3D	MonoUNI	See all
SUN-RGBD	CAGroup3D (Geo Only)	See all
waymo cyclist	DSVT(val)	See all
waymo pedestrian	DSVT(val)	See all
S3DIS	Point-GCC+TR3D	See all
V2XSet	V2X-ViT	See all
nuScenes LiDAR only	DSVT	See all
OPV2V	V2VNet (PointPillar backbone)	See all
KITTI Pedestrian Easy val	PVCNN	See all
KITTI Pedestrian Moderate val	PVCNN	See all
KITTI Pedestrian Hard val	PVCNN	See all
KITTI Cyclist Easy val	PVCNN	See all
KITTI Cyclist Moderate val	PVCNN	See all
KITTI Cyclist Hard val	F-PointNet++ [Qi:2018fd]	See all
aiMotive Dataset	Lidar-Radar-Camera	See all
3D Object Detection on Argoverse2 Camera Only	Far3D	See all
waymo all_ns	CenterPoint	See all
NYU Depth v2	SGPN-CNN	See all
nuScenes-F	RRPN + R101 - F	See all
nuScenes-FB	RRPN + R101 - FB	See all
KITTI Pedestrian Hard	PiFeNet	See all
KITTI Cyclists Moderate val	Deformable PV-RCNN	See all
KITTI Pedestrians Moderate val	Deformable PV-RCNN	See all
Dense Fog	PV-RCNN	See all
KITTI Pedestrian Moderate	PiFeNet	See all
Heavy Snowfall	PV-RCNN	See all
Light Snowfall	PV-RCNN	See all
Clear Weather	PV-RCNN	See all
KITTI Pedestrian Easy	PiFeNet	See all
KITTI Pedestrian	PiFeNet	See all
V2X-SIM	Where2comm	See all
DAIR-V2X	CoBEVFlow	See all
Argoverse2	VoxelNeXt	See all
Cityscapes 3D	TaskPrompter	See all
Argoverse	ky_nctu_mo	See all
IRV2V	CoBEVFlow	See all

Show all 55 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find 3D Object Detection models and implementations

open-mmlab/mmdetection3d

14 papers

4,808

PaddlePaddle/Paddle3D

6 papers

534

open-mmlab/OpenPCDet

5 papers

4,320

DerrickXuNu/OpenCOOD

5 papers

593

See all 11 libraries.

Datasets

Subtasks

Robust 3D Object Detection

Robust BEV Detection

Latest papers

Most implemented Social Latest No code

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

faceonlive/ai-research • 10 Apr 2024

Finally, for MC3D-Det joint training, the elaborate dataset merge strategy is designed to solve the problem of inconsistent camera numbers and camera parameters.

152

10 Apr 2024

Paper
Code

Better Monocular 3D Detectors with LiDAR from the Past

faceonlive/ai-research • 8 Apr 2024

Accurate 3D object detection is crucial to autonomous driving.

152

08 Apr 2024

Paper
Code

MonoCD: Monocular 3D Object Detection with Complementary Depths

elvintanhust/monocd • • 4 Apr 2024

Monocular 3D object detection has attracted widespread attention due to its potential to accurately obtain object 3D localization from a single image at a low cost.

04 Apr 2024

Paper
Code

HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras

vdigpku/henet • 3 Apr 2024

Three-dimensional perception from multi-view cameras is a crucial component in autonomous driving systems, which involves multiple tasks like 3D object detection and bird's-eye-view (BEV) semantic segmentation.

03 Apr 2024

Paper
Code

VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection

skmhrk1209/VSRD • • 2 Apr 2024

In the auto-labeling stage, we represent the surface of each instance as a signed distance field (SDF) and render its silhouette as an instance mask through our proposed instance-aware volumetric silhouette rendering.

02 Apr 2024

Paper
Code

Weak-to-Strong 3D Object Detection with X-Ray Distillation

sakharok13/x-ray-teacher-patching-tools • 31 Mar 2024

This paper addresses the critical challenges of sparsity and occlusion in LiDAR-based 3D object detection.

31 Mar 2024

Paper
Code

SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects

abhi1kumar/seabird • • 29 Mar 2024

We argue that the cause of failure is the sensitivity of depth regression losses to noise of larger objects.

29 Mar 2024

Paper
Code

UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain Gaps

maxiuw/uada3d • 26 Mar 2024

In this study, we address a gap in existing unsupervised domain adaptation approaches on LiDAR-based 3D object detection, which have predominantly concentrated on adapting between established, high-density autonomous driving datasets.

26 Mar 2024

Paper
Code

RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection

vdigpku/rcbevdet • 25 Mar 2024

In the dual-stream radar backbone, a point-based encoder and a transformer-based encoder are proposed to extract radar features, with an injection and extraction module to facilitate communication between the two encoders.

25 Mar 2024

Paper
Code

Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions

ywyeli/place3d • • 25 Mar 2024

The robustness of driving perception systems under unprecedented conditions is crucial for safety-critical usages.

25 Mar 2024

Paper
Code

3D Object Detection

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result