TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Object Detection From Monocular Images	KITTI-360	GUPNet	AP50	0.87	# 6
3D Object Detection From Monocular Images	KITTI-360	GUPNet	AP25	27.25	# 5
3D Object Detection From Monocular Images	Waymo Open Dataset	GUP Net	3D mAPH Vehicle (Front Camera Only)	2.14	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/geometry-uncertainty-projection-network-for/3d-object-detection-from-monocular-images-on-6)](https://paperswithcode.com/sota/3d-object-detection-from-monocular-images-on-6?p=geometry-uncertainty-projection-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/geometry-uncertainty-projection-network-for/3d-object-detection-from-monocular-images-on-7)](https://paperswithcode.com/sota/3d-object-detection-from-monocular-images-on-7?p=geometry-uncertainty-projection-network-for)`

Geometry Uncertainty Projection Network for Monocular 3D Object Detection

ICCV 2021 · Yan Lu, Xinzhu Ma, Lei Yang, Tianzhu Zhang, Yating Liu, Qi Chu, Junjie Yan, Wanli Ouyang ·

Geometry Projection is a powerful depth estimation method in monocular 3D object detection. It estimates depth dependent on heights, which introduces mathematical priors into the deep model. But projection process also introduces the error amplification problem, in which the error of the estimated height will be amplified and reflected greatly at the output depth. This property leads to uncontrollable depth inferences and also damages the training efficiency. In this paper, we propose a Geometry Uncertainty Projection Network (GUP Net) to tackle the error amplification problem at both inference and training stages. Specifically, a GUP module is proposed to obtains the geometry-guided uncertainty of the inferred depth, which not only provides high reliable confidence for each depth but also benefits depth learning. Furthermore, at the training stage, we propose a Hierarchical Task Learning strategy to reduce the instability caused by error amplification. This learning algorithm monitors the learning situation of each task by a proposed indicator and adaptively assigns the proper loss weights for different tasks according to their pre-tasks situation. Based on that, each task starts learning only when its pre-tasks are learned well, which can significantly improve the stability and efficiency of the training process. Extensive experiments demonstrate the effectiveness of the proposed method. The overall model can infer more reliable object depth than existing methods and outperforms the state-of-the-art image-based monocular 3D detectors by 3.74% and 4.7% AP40 of the car and pedestrian categories on the KITTI benchmark.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Code

Add Remove Mark official

supermhp/gupnet official

126

Tasks

Add Remove

3D Object Detection

3D Object Detection From Monocular Images

Depth Estimation

Monocular 3D Object Detection

Object

object-detection

Object Detection

Datasets

KITTI

Waymo Open Dataset

KITTI-360

Results from the Paper

Edit

Ranked #2 on 3D Object Detection From Monocular Images on Waymo Open Dataset

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Object Detection From Monocular Images	KITTI-360	GUPNet	AP50	0.87	# 6	Compare
3D Object Detection From Monocular Images	KITTI-360	GUPNet	AP25	27.25	# 5	Compare
3D Object Detection From Monocular Images	Waymo Open Dataset	GUP Net	3D mAPH Vehicle (Front Camera Only)	2.14	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Geometry Uncertainty Projection Network for Monocular 3D Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove