TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Bird's-Eye View Semantic Segmentation	nuScenes	CVT	IoU veh - 224x480 - No vis filter - 100x100 at 0.5	31.4	# 9
Bird's-Eye View Semantic Segmentation	nuScenes	CVT	IoU veh - 448x800 - No vis filter - 100x100 at 0.5	32.5	# 6
Bird's-Eye View Semantic Segmentation	nuScenes	CVT	IoU veh - 224x480 - Vis filter. - 100x100 at 0.5	36.0	# 8
Bird's-Eye View Semantic Segmentation	nuScenes	CVT	IoU veh - 448x800 - Vis filter. - 100x100 at 0.5	37.7	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-view-transformers-for-real-time-map/bird-s-eye-view-semantic-segmentation-on)](https://paperswithcode.com/sota/bird-s-eye-view-semantic-segmentation-on?p=cross-view-transformers-for-real-time-map)`

Cross-view Transformers for real-time Map-view Semantic Segmentation

CVPR 2022 · Brady Zhou, Philipp Krähenbühl ·

We present cross-view transformers, an efficient attention-based model for map-view semantic segmentation from multiple cameras. Our architecture implicitly learns a mapping from individual camera views into a canonical map-view representation using a camera-aware cross-view attention mechanism. Each camera uses positional embeddings that depend on its intrinsic and extrinsic calibration. These embeddings allow a transformer to learn the mapping across different views without ever explicitly modeling it geometrically. The architecture consists of a convolutional image encoder for each view and cross-view transformer layers to infer a map-view semantic segmentation. Our model is simple, easily parallelizable, and runs in real-time. The presented architecture performs at state-of-the-art on the nuScenes dataset, with 4x faster inference speeds. Code is available at https://github.com/bradyz/cross_view_transformers.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

bradyz/cross_view_transformers official

505

valeoai/pointbev

Tasks

Add Remove

Bird's-Eye View Semantic Segmentation

Segmentation

Semantic Segmentation

Datasets

nuScenes

Argoverse

Results from the Paper

Edit

Ranked #8 on Bird's-Eye View Semantic Segmentation on nuScenes

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Bird's-Eye View Semantic Segmentation	nuScenes	CVT	IoU veh - 224x480 - No vis filter - 100x100 at 0.5	31.4	# 9	Compare
			IoU veh - 448x800 - No vis filter - 100x100 at 0.5	32.5	# 6	Compare
			IoU veh - 224x480 - Vis filter. - 100x100 at 0.5	36.0	# 8	Compare
			IoU veh - 448x800 - Vis filter. - 100x100 at 0.5	37.7	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Cross-view Transformers for real-time Map-view Semantic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove