Monocular 3D Object Detection

77 papers with code • 15 benchmarks • 5 datasets

Monocular 3D Object Detection is the task to draw 3D bounding box around objects in a single 2D RGB image. It is localization task but without any extra information like depth or other sensors or multiple-images.

Benchmarks

Add a Result

These leaderboards are used to track progress in Monocular 3D Object Detection

Dataset	Best Model	Compare
KITTI Cars Moderate	CIE	See all
SUN RGB-D	IM3D	See all
KITTI Cars Hard	CIE	See all
KITTI Pedestrian Hard	DD3D	See all
KITTI Cars Easy	CIE	See all
KITTI Pedestrian Easy	CMKD	See all
KITTI Pedestrian Moderate	CMKD	See all
Google Objectron	Lin2021	See all
KITTI Cyclist Easy	CMKD	See all
KITTI Cyclist Moderate	CMKD	See all
KITTI Cyclist Hard	CMKD	See all
KITTI Pedestrians Moderate val	CubifAE-3D	See all
Virtual KITTI 2	CubifAE-3D	See all
CoPerception-UAVs	Where2comm	See all
OPV2V	Where2comm	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Monocular 3D Object Detection models and implementations

open-mmlab/mmdetection3d

3 papers

4,823

PaddlePaddle/Paddle3D

2 papers

541

Owen-Liuyuxuan/visualDet3D

2 papers

358

Datasets

Latest papers

Most implemented Social Latest No code

Delving into Motion-Aware Matching for Monocular 3D Object Tracking

kuanchihhuang/moma-m3t • • ICCV 2023

In this paper, we find that the motion cue of objects along different time frames is critical in 3D multi-object tracking, which is less explored in existing monocular-based approaches.

22 Aug 2023

Paper
Code

MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection

cskkxjk/mononerd • • ICCV 2023

To the best of our knowledge, this work is the first to introduce volume rendering for M3D, and demonstrates the potential of implicit reconstruction for image-based 3D perception.

18 Aug 2023

Paper
Code

VCVW-3D: A Virtual Construction Vehicles and Workers Dataset with 3D Annotations

dyxm/vcvw-3d • 29 May 2023

Currently, object detection applications in construction are almost based on pure 2D data (both image and annotation are 2D-based), resulting in the developed artificial intelligence (AI) applications only applicable to some scenarios that only require 2D information.

29 May 2023

Paper
Code

Learning Occupancy for Monocular 3D Object Detection

spengliang/occupancym3d • • 25 May 2023

Monocular 3D detection is a challenging task due to the lack of accurate 3D information.

25 May 2023

Paper
Code

SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object Detection

mikasa3lili/ssd-monodetr • • 12 May 2023

To tackle this problem, this paper proposes a novel "Supervised Scale-aware Deformable Attention" (SSDA) for monocular 3D object detection.

12 May 2023

Paper
Code

SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

mit-han-lab/sparsevit • • CVPR 2023

High-resolution images enable neural networks to learn richer visual representations.

30 Mar 2023

Paper
Code

3DPPE: 3D Point Positional Encoding for Transformer-based Multi-Camera 3D Object Detection

drilistbox/3dppe • • ICCV 2023

Although 3D measurements are not available at the inference time of monocular 3D object detection, 3DPPE uses predicted depth to approximate the real point positions.

01 Jan 2023

Paper
Code

OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection

mrsempress/obmo • 20 Dec 2022

Unfortunately, the network cannot accurately distinguish different depths from such non-discriminative visual features, resulting in unstable depth training.

20 Dec 2022

Paper
Code

3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection Transformers

drilistbox/3dppe • • 27 Nov 2022

Although 3D measurements are not available at the inference time of monocular 3D object detection, 3DPPE uses predicted depth to approximate the real point positions.

27 Nov 2022

Paper
Code

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection

Cc-Hy/CMKD • • 14 Nov 2022

Leveraging LiDAR-based detectors or real LiDAR point data to guide monocular 3D detection has brought significant improvement, e. g., Pseudo-LiDAR methods.

102

14 Nov 2022

Paper
Code

Monocular 3D Object Detection

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result