TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Object Detection From Monocular Images	KITTI-360	MonoDLE	AP50	0.85	# 7
3D Object Detection From Monocular Images	KITTI-360	MonoDLE	AP25	28.99	# 4
Monocular 3D Object Detection	KITTI Cars Moderate	MonoDLE	AP Medium	12.26	# 18
3D Object Detection	Rope3D	MonoDLE+(G)	AP@0.7	13.58	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delving-into-localization-errors-for/3d-object-detection-from-monocular-images-on-7)](https://paperswithcode.com/sota/3d-object-detection-from-monocular-images-on-7?p=delving-into-localization-errors-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delving-into-localization-errors-for/3d-object-detection-on-rope3d)](https://paperswithcode.com/sota/3d-object-detection-on-rope3d?p=delving-into-localization-errors-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delving-into-localization-errors-for/monocular-3d-object-detection-on-kitti-cars)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-cars?p=delving-into-localization-errors-for)`

Delving into Localization Errors for Monocular 3D Object Detection

CVPR 2021 · Xinzhu Ma, Yinmin Zhang, Dan Xu, Dongzhan Zhou, Shuai Yi, Haojie Li, Wanli Ouyang ·

Estimating 3D bounding boxes from monocular images is an essential component in autonomous driving, while accurate 3D object detection from this kind of data is very challenging. In this work, by intensive diagnosis experiments, we quantify the impact introduced by each sub-task and found the `localization error' is the vital factor in restricting monocular 3D detection. Besides, we also investigate the underlying reasons behind localization errors, analyze the issues they might bring, and propose three strategies. First, we revisit the misalignment between the center of the 2D bounding box and the projected center of the 3D object, which is a vital factor leading to low localization accuracy. Second, we observe that accurately localizing distant objects with existing technologies is almost impossible, while those samples will mislead the learned network. To this end, we propose to remove such samples from the training set for improving the overall performance of the detector. Lastly, we also propose a novel 3D IoU oriented loss for the size estimation of the object, which is not affected by `localization error'. We conduct extensive experiments on the KITTI dataset, where the proposed method achieves real-time detection and outperforms previous methods by a large margin. The code will be made available at: https://github.com/xinzhuma/monodle.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

xinzhuma/monodle official

153

Tasks

Add Remove

3D Object Detection

3D Object Detection From Monocular Images

Autonomous Driving

Monocular 3D Object Detection

Object

object-detection

Object Detection

Datasets

KITTI

KITTI-360 Rope3D

Results from the Paper

Edit

Ranked #7 on 3D Object Detection From Monocular Images on KITTI-360

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Object Detection From Monocular Images	KITTI-360	MonoDLE	AP50	0.85	# 7	Compare
3D Object Detection From Monocular Images	KITTI-360	MonoDLE	AP25	28.99	# 4	Compare
Monocular 3D Object Detection	KITTI Cars Moderate	MonoDLE	AP Medium	12.26	# 18	Compare
3D Object Detection	Rope3D	MonoDLE+(G)	AP@0.7	13.58	# 8	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Delving into Localization Errors for Monocular 3D Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove