TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Object Detection	DAIR-V2X-I	MonoUNI	AP\|R40(moderate)	87.2	# 1
3D Object Detection	DAIR-V2X-I	MonoUNI	AP\|R40(easy)	90.92	# 1
3D Object Detection	DAIR-V2X-I	MonoUNI	AP\|R40(hard)	87.2	# 1
Monocular 3D Object Detection	KITTI Cars Moderate	MonoUNI	AP Medium	16.73	# 4
3D Object Detection	Rope3D	MonoUNI	AP@0.7	75.27	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/monouni-a-unified-vehicle-and-infrastructure/3d-object-detection-on-dair-v2x-i)](https://paperswithcode.com/sota/3d-object-detection-on-dair-v2x-i?p=monouni-a-unified-vehicle-and-infrastructure)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/monouni-a-unified-vehicle-and-infrastructure/3d-object-detection-on-rope3d)](https://paperswithcode.com/sota/3d-object-detection-on-rope3d?p=monouni-a-unified-vehicle-and-infrastructure)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/monouni-a-unified-vehicle-and-infrastructure/monocular-3d-object-detection-on-kitti-cars)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-cars?p=monouni-a-unified-vehicle-and-infrastructure)`

MonoUNI: A Unified Vehicle and Infrastructure-side Monocular 3D Object Detection Network with Sufficient Depth Clues

Neural Information Processing Systems 2023 · Jinrang Jia, Zhenjia Li, Yifeng Shi ·

Monocular 3D detection of vehicle and infrastructure sides are two important topics in autonomous driving. Due to diverse sensor installations and focal lengths, researchers are faced with the challenge of constructing algorithms for the two topics based on different prior knowledge. In this paper, by taking into account the diversity of pitch angles and focal lengths, we propose a unified optimization target named normalized depth, which realizes the unification of 3D detection problems for the two sides. Furthermore, to enhance the accuracy of monocular 3D detection, 3D normalized cube depth of obstacle is developed to promote the learning of depth information. We posit that the richness of depth clues is a pivotal factor impacting the detection performance on both the vehicle and infrastructure sides. A richer set of depth clues facilitates the model to learn better spatial knowledge, and the 3D normalized cube depth offers sufficient depth clues. Extensive experiments demonstrate the effectiveness of our approach. Without introducing any extra information, our method, named MonoUNI, achieves state-of-the-art performance on five widely used monocular 3D detection benchmarks, including Rope3D and DAIR-V2X-I for the infrastructure side, KITTI and Waymo for the vehicle side, and nuScenes for the cross-dataset evaluation.

PDF Abstract Neural Information 2023 PDF

Code

Add Remove Mark official

Traffic-X/MonoUNI official

Tasks

Add Remove

3D Object Detection

Autonomous Driving

Monocular 3D Object Detection

object-detection

Object Detection

Datasets

KITTI DAIR-V2X Rope3D

Results from the Paper

Add Remove

Ranked #1 on 3D Object Detection on DAIR-V2X-I

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Object Detection	DAIR-V2X-I	MonoUNI	AP\|R40(moderate)	87.2	# 1	Compare
			AP\|R40(easy)	90.92	# 1	Compare
			AP\|R40(hard)	87.2	# 1	Compare
Monocular 3D Object Detection	KITTI Cars Moderate	MonoUNI	AP Medium	16.73	# 4	Compare
3D Object Detection	Rope3D	MonoUNI	AP@0.7	75.27	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

MonoUNI: A Unified Vehicle and Infrastructure-side Monocular 3D Object Detection Network with Sufficient Depth Clues

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove