Monocular 3D Object Detection

77 papers with code • 15 benchmarks • 5 datasets

Monocular 3D Object Detection is the task to draw 3D bounding box around objects in a single 2D RGB image. It is localization task but without any extra information like depth or other sensors or multiple-images.

Libraries

Use these libraries to find Monocular 3D Object Detection models and implementations

Delving into Motion-Aware Matching for Monocular 3D Object Tracking

kuanchihhuang/moma-m3t ICCV 2023

In this paper, we find that the motion cue of objects along different time frames is critical in 3D multi-object tracking, which is less explored in existing monocular-based approaches.

24
22 Aug 2023

MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection

cskkxjk/mononerd ICCV 2023

To the best of our knowledge, this work is the first to introduce volume rendering for M3D, and demonstrates the potential of implicit reconstruction for image-based 3D perception.

69
18 Aug 2023

VCVW-3D: A Virtual Construction Vehicles and Workers Dataset with 3D Annotations

dyxm/vcvw-3d 29 May 2023

Currently, object detection applications in construction are almost based on pure 2D data (both image and annotation are 2D-based), resulting in the developed artificial intelligence (AI) applications only applicable to some scenarios that only require 2D information.

1
29 May 2023

Learning Occupancy for Monocular 3D Object Detection

spengliang/occupancym3d 25 May 2023

Monocular 3D detection is a challenging task due to the lack of accurate 3D information.

78
25 May 2023

SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object Detection

mikasa3lili/ssd-monodetr 12 May 2023

To tackle this problem, this paper proposes a novel "Supervised Scale-aware Deformable Attention" (SSDA) for monocular 3D object detection.

16
12 May 2023

SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

mit-han-lab/sparsevit CVPR 2023

High-resolution images enable neural networks to learn richer visual representations.

51
30 Mar 2023

3DPPE: 3D Point Positional Encoding for Transformer-based Multi-Camera 3D Object Detection

drilistbox/3dppe ICCV 2023

Although 3D measurements are not available at the inference time of monocular 3D object detection, 3DPPE uses predicted depth to approximate the real point positions.

49
01 Jan 2023

OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection

mrsempress/obmo 20 Dec 2022

Unfortunately, the network cannot accurately distinguish different depths from such non-discriminative visual features, resulting in unstable depth training.

2
20 Dec 2022

3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection Transformers

drilistbox/3dppe 27 Nov 2022

Although 3D measurements are not available at the inference time of monocular 3D object detection, 3DPPE uses predicted depth to approximate the real point positions.

49
27 Nov 2022

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection

Cc-Hy/CMKD 14 Nov 2022

Leveraging LiDAR-based detectors or real LiDAR point data to guide monocular 3D detection has brought significant improvement, e. g., Pseudo-LiDAR methods.

102
14 Nov 2022