TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Bird's-Eye View Semantic Segmentation	nuScenes	TBP-Former	IoU ped - 224x480 - Vis filter. - 100x100 at 0.5	18.6	# 2
Bird's-Eye View Semantic Segmentation	nuScenes	TBP-Former (static)	IoU ped - 224x480 - Vis filter. - 100x100 at 0.5	17.2	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tbp-former-learning-temporal-bird-s-eye-view/bird-s-eye-view-semantic-segmentation-on)](https://paperswithcode.com/sota/bird-s-eye-view-semantic-segmentation-on?p=tbp-former-learning-temporal-bird-s-eye-view)`

TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving

CVPR 2023 · Shaoheng Fang, Zi Wang, Yiqi Zhong, Junhao Ge, Siheng Chen, Yanfeng Wang ·

Vision-centric joint perception and prediction (PnP) has become an emerging trend in autonomous driving research. It predicts the future states of the traffic participants in the surrounding environment from raw RGB images. However, it is still a critical challenge to synchronize features obtained at multiple camera views and timestamps due to inevitable geometric distortions and further exploit those spatial-temporal features. To address this issue, we propose a temporal bird's-eye-view pyramid transformer (TBP-Former) for vision-centric PnP, which includes two novel designs. First, a pose-synchronized BEV encoder is proposed to map raw image inputs with any camera pose at any time to a shared and synchronized BEV space for better spatial-temporal synchronization. Second, a spatial-temporal pyramid transformer is introduced to comprehensively extract multi-scale BEV features and predict future BEV states with the support of spatial-temporal priors. Extensive experiments on nuScenes dataset show that our proposed framework overall outperforms all state-of-the-art vision-based prediction methods.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Autonomous Driving

Bird's-Eye View Semantic Segmentation

Datasets

nuScenes

Results from the Paper

Edit

Ranked #2 on Bird's-Eye View Semantic Segmentation on nuScenes (IoU ped - 224x480 - Vis filter. - 100x100 at 0.5 metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Bird's-Eye View Semantic Segmentation	nuScenes	TBP-Former	IoU ped - 224x480 - Vis filter. - 100x100 at 0.5	18.6	# 2		Compare
Bird's-Eye View Semantic Segmentation	nuScenes	TBP-Former (static)	IoU ped - 224x480 - Vis filter. - 100x100 at 0.5	17.2	# 4		Compare

Methods

Add Remove

PnP

Edit Social Preview

TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove