3D Object Detection
585 papers with code • 55 benchmarks • 48 datasets
3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.
( Image credit: AVOD )
Libraries
Use these libraries to find 3D Object Detection models and implementationsSubtasks
Latest papers with no code
SSF3D: Strict Semi-Supervised 3D Object Detection with Switching Filter
The experiments are conducted to analyze the effectiveness of above approaches and their impact on the overall performance of the system.
Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection
Additionally, we present a DepthGradient Projection (DGP) module to mitigate optimization conflicts caused by noisy depth supervision of pseudo-labels, effectively decoupling the depth gradient and removing conflicting gradients.
Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks
It is essential to prove that lossy video compression artifacts do not impact the performance of the perception algorithms.
CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking
Accurate detection and tracking of surrounding objects is essential to enable self-driving vehicles.
Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Training high-accuracy 3D detectors necessitates massive labeled 3D annotations with 7 degree-of-freedom, which is laborious and time-consuming.
3D Object Detection from Point Cloud via Voting Step Diffusion
In this work, we focus on the distributional properties of point clouds and formulate the voting process as generating new points in the high-density region of the distribution of object centers.
Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments
In this work, we tackle the limitations of current LiDAR-based 3D object detection systems, which are hindered by a restricted class vocabulary and the high costs associated with annotating new object classes.
SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model
We introduce SceneScript, a method that directly produces full scene models as a sequence of structured language commands using an autoregressive, token-based approach.
Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem
Typical LiDAR-based 3D object detection models are trained in a supervised manner with real-world data collection, which is often imbalanced over classes (or long-tailed).
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Additionally, we propose a Global Align module to rectify the misalignment between LiDAR and camera BEV features.