TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Panoptic Tracking	Panoptic nuScenes test	4D-Former	PAT	79.4	# 1
Panoptic Tracking	Panoptic nuScenes test	4D-Former	LSTQ	78.2	# 1
Panoptic Tracking	Panoptic nuScenes test	4D-Former	PTQ	75.5	# 1
Panoptic Tracking	Panoptic nuScenes val	4D-Former	PAT	78.3	# 1
Panoptic Tracking	Panoptic nuScenes val	4D-Former	LSTQ	76.4	# 1
Panoptic Tracking	Panoptic nuScenes val	4D-Former	PTQ	75.2	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/4d-former-multimodal-4d-panoptic-segmentation/panoptic-tracking-on-panoptic-nuscenes-test)](https://paperswithcode.com/sota/panoptic-tracking-on-panoptic-nuscenes-test?p=4d-former-multimodal-4d-panoptic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/4d-former-multimodal-4d-panoptic-segmentation/panoptic-tracking-on-panoptic-nuscenes-val)](https://paperswithcode.com/sota/panoptic-tracking-on-panoptic-nuscenes-val?p=4d-former-multimodal-4d-panoptic-segmentation)`

4D-Former: Multimodal 4D Panoptic Segmentation

2 Nov 2023 · Ali Athar, Enxu Li, Sergio Casas, Raquel Urtasun ·

4D panoptic segmentation is a challenging but practically useful task that requires every point in a LiDAR point-cloud sequence to be assigned a semantic class label, and individual objects to be segmented and tracked over time. Existing approaches utilize only LiDAR inputs which convey limited information in regions with point sparsity. This problem can, however, be mitigated by utilizing RGB camera images which offer appearance-based information that can reinforce the geometry-based LiDAR features. Motivated by this, we propose 4D-Former: a novel method for 4D panoptic segmentation which leverages both LiDAR and image modalities, and predicts semantic masks as well as temporally consistent object masks for the input point-cloud sequence. We encode semantic classes and objects using a set of concise queries which absorb feature information from both data modalities. Additionally, we propose a learned mechanism to associate object tracks over time which reasons over both appearance and spatial location. We apply 4D-Former to the nuScenes and SemanticKITTI datasets where it achieves state-of-the-art results.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

4D Panoptic Segmentation

Panoptic Segmentation

Panoptic Tracking

Segmentation

Datasets

nuScenes

SemanticKITTI

Results from the Paper

Edit

Ranked #1 on Panoptic Tracking on Panoptic nuScenes val

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Panoptic Tracking	Panoptic nuScenes test	4D-Former	PAT	79.4	# 1	Compare
			LSTQ	78.2	# 1	Compare
			PTQ	75.5	# 1	Compare
Panoptic Tracking	Panoptic nuScenes val	4D-Former	PAT	78.3	# 1	Compare
			LSTQ	76.4	# 1	Compare
			PTQ	75.2	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

4D-Former: Multimodal 4D Panoptic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove