3D Object Detection
585 papers with code • 55 benchmarks • 48 datasets
3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.
( Image credit: AVOD )
Libraries
Use these libraries to find 3D Object Detection models and implementationsSubtasks
Latest papers
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection
HSF applies Point-to-Grid and Grid-to-Region transformers to capture the multimodal scene context at different granularities.
RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception
The value of roadside perception, which could extend the boundaries of autonomous driving and traffic management, has gradually become more prominent and acknowledged in recent years.
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
Learning robust and scalable visual representations from massive multi-view video data remains a challenge in computer vision and autonomous driving.
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
Low-cost, vision-centric 3D perception systems for autonomous driving have made significant progress in recent years, narrowing the gap to expensive LiDAR-based methods.
3D Semantic Segmentation-Driven Representations for 3D Object Detection
In autonomous driving, 3D detection provides more precise information to downstream tasks, including path planning and motion estimation, compared to 2D detection.
Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid for 3D Object Detection
Through STV grids, points within each pillar are individually encoded using Vertical PFE (V-PFE), Temporal PFE (T-PFE), and Horizontal PFE (H-PFE).
SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection
LiDAR-based 3D object detection plays an essential role in autonomous driving.
CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images
This paper introduces CN-RMA, a novel approach for 3D indoor object detection from multi-view images.
Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation for Autonomous Driving
Collectively, these contributions lay a robust foundation for the widespread adoption of vision-based 3D perception technologies in autonomous driving applications.
Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection
3D object detection based on LiDAR point cloud and prior anchor boxes is a critical technology for autonomous driving environment perception and understanding.