3D Object Detection

585 papers with code • 55 benchmarks • 48 datasets

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Benchmarks

Add a Result

These leaderboards are used to track progress in 3D Object Detection

Dataset	Best Model	Compare
KITTI Cars Moderate	GLENet-VR	See all
nuScenes	EA-LSS	See all
SUN-RGBD val	Point-GCC+TR3D+FF	See all
ScanNetV2	UDeerLvic	See all
KITTI Cars Easy	GLENet-VR	See all
KITTI Cars Hard	3D Dual-Fusion	See all
nuScenes Camera Only	Far3D	See all
KITTI Pedestrians Moderate	3D-FCT	See all
KITTI Cyclists Easy	3D-FCT	See all
KITTI Cyclists Moderate	3D-FCT	See all
KITTI Cyclists Hard	SA-Det3D	See all
KITTI Cars Easy val	SA-SSD+EBM	See all
KITTI Cars Moderate val	SA-SSD+EBM	See all
KITTI Cars Hard val	M3DeTR	See all
KITTI Pedestrians Easy	IPOD	See all
KITTI Pedestrians Hard	SVGA-Net	See all
DAIR-V2X-I	MonoUNI	See all
nuscenes Camera-Radar	HyDRa	See all
waymo vehicle	PillarNeXt	See all
Rope3D	MonoUNI	See all
SUN-RGBD	CAGroup3D (Geo Only)	See all
waymo cyclist	DSVT(val)	See all
waymo pedestrian	DSVT(val)	See all
S3DIS	Point-GCC+TR3D	See all
V2XSet	V2X-ViT	See all
nuScenes LiDAR only	DSVT	See all
OPV2V	V2VNet (PointPillar backbone)	See all
KITTI Pedestrian Easy val	PVCNN	See all
KITTI Pedestrian Moderate val	PVCNN	See all
KITTI Pedestrian Hard val	PVCNN	See all
KITTI Cyclist Easy val	PVCNN	See all
KITTI Cyclist Moderate val	PVCNN	See all
KITTI Cyclist Hard val	F-PointNet++ [Qi:2018fd]	See all
aiMotive Dataset	Lidar-Radar-Camera	See all
3D Object Detection on Argoverse2 Camera Only	Far3D	See all
waymo all_ns	CenterPoint	See all
NYU Depth v2	SGPN-CNN	See all
nuScenes-F	RRPN + R101 - F	See all
nuScenes-FB	RRPN + R101 - FB	See all
KITTI Pedestrian Hard	PiFeNet	See all
KITTI Cyclists Moderate val	Deformable PV-RCNN	See all
KITTI Pedestrians Moderate val	Deformable PV-RCNN	See all
Dense Fog	PV-RCNN	See all
KITTI Pedestrian Moderate	PiFeNet	See all
Heavy Snowfall	PV-RCNN	See all
Light Snowfall	PV-RCNN	See all
Clear Weather	PV-RCNN	See all
KITTI Pedestrian Easy	PiFeNet	See all
KITTI Pedestrian	PiFeNet	See all
V2X-SIM	Where2comm	See all
DAIR-V2X	CoBEVFlow	See all
Argoverse2	VoxelNeXt	See all
Cityscapes 3D	TaskPrompter	See all
Argoverse	ky_nctu_mo	See all
IRV2V	CoBEVFlow	See all

Show all 55 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find 3D Object Detection models and implementations

open-mmlab/mmdetection3d

14 papers

4,813

PaddlePaddle/Paddle3D

6 papers

537

open-mmlab/OpenPCDet

5 papers

4,325

DerrickXuNu/OpenCOOD

5 papers

594

See all 11 libraries.

Datasets

Subtasks

Robust 3D Object Detection

Robust BEV Detection

Latest papers with no code

Most implemented Social Latest No code

SSF3D: Strict Semi-Supervised 3D Object Detection with Switching Filter

no code yet • 26 Mar 2024

The experiments are conducted to analyze the effectiveness of above approaches and their impact on the overall performance of the system.

Paper
Add Code

Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

no code yet • 26 Mar 2024

Additionally, we present a DepthGradient Projection (DGP) module to mitigate optimization conflicts caused by noisy depth supervision of pseudo-labels, effectively decoupling the depth gradient and removing conflicting gradients.

Paper
Add Code

Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks

no code yet • 25 Mar 2024

It is essential to prove that lossy video compression artifacts do not impact the performance of the perception algorithms.

Paper
Add Code

CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking

no code yet • 22 Mar 2024

Accurate detection and tracking of surrounding objects is essential to enable self-driving vehicles.

Paper
Add Code

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection

no code yet • 22 Mar 2024

Training high-accuracy 3D detectors necessitates massive labeled 3D annotations with 7 degree-of-freedom, which is laborious and time-consuming.

Paper
Add Code

3D Object Detection from Point Cloud via Voting Step Diffusion

no code yet • 21 Mar 2024

In this work, we focus on the distributional properties of point clouds and formulate the voting process as generating new points in the high-density region of the distribution of object centers.

Paper
Add Code

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

no code yet • 20 Mar 2024

In this work, we tackle the limitations of current LiDAR-based 3D object detection systems, which are hindered by a restricted class vocabulary and the high costs associated with annotating new object classes.

Paper
Add Code

SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model

no code yet • 19 Mar 2024

We introduce SceneScript, a method that directly produces full scene models as a sequence of structured language commands using an autoregressive, token-based approach.

Paper
Add Code

Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem

no code yet • 18 Mar 2024

Typical LiDAR-based 3D object detection models are trained in a supervised manner with real-world data collection, which is often imbalanced over classes (or long-tailed).

Paper
Add Code

GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

no code yet • 18 Mar 2024

Additionally, we propose a Global Align module to rectify the misalignment between LiDAR and camera BEV features.

Paper
Add Code

3D Object Detection

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result