TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Domain Adaptation	Cityscapes to ACDC	CMFormer	mIoU	60.1	# 8
Source-Free Domain Adaptation	Cityscapes to ACDC	CMFormer	mIoU	60.1	# 2
Domain Generalization	GTA5-to-Cityscapes	CMFormer	mIoU	55.31	# 3
Domain Generalization	GTA-to-Avg(Cityscapes,BDD,Mapillary)	CMFormer	mIoU	51.10	# 8
Synthetic-to-Real Translation	GTAV-to-Cityscapes Labels	CMFormer	mIoU	59.7	# 18
Semantic Segmentation	GTAV-to-Cityscapes Labels	CMFormer	mIoU	55.3	# 11
Synthetic-to-Real Translation	SYNTHIA-to-Cityscapes Labels	CMFormer	mIOU	44.6	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-content-enhanced-mask-transformer/source-free-domain-adaptation-on-cityscapes)](https://paperswithcode.com/sota/source-free-domain-adaptation-on-cityscapes?p=learning-content-enhanced-mask-transformer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-content-enhanced-mask-transformer/synthetic-to-real-translation-on-synthia-to-2)](https://paperswithcode.com/sota/synthetic-to-real-translation-on-synthia-to-2?p=learning-content-enhanced-mask-transformer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-content-enhanced-mask-transformer/domain-generalization-on-gta5-to-cityscapes)](https://paperswithcode.com/sota/domain-generalization-on-gta5-to-cityscapes?p=learning-content-enhanced-mask-transformer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-content-enhanced-mask-transformer/domain-adaptation-on-cityscapes-to-acdc)](https://paperswithcode.com/sota/domain-adaptation-on-cityscapes-to-acdc?p=learning-content-enhanced-mask-transformer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-content-enhanced-mask-transformer/domain-generalization-on-gta-to-avg)](https://paperswithcode.com/sota/domain-generalization-on-gta-to-avg?p=learning-content-enhanced-mask-transformer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-content-enhanced-mask-transformer/semantic-segmentation-on-gtav-to-cityscapes-1)](https://paperswithcode.com/sota/semantic-segmentation-on-gtav-to-cityscapes-1?p=learning-content-enhanced-mask-transformer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-content-enhanced-mask-transformer/synthetic-to-real-translation-on-gtav-to)](https://paperswithcode.com/sota/synthetic-to-real-translation-on-gtav-to?p=learning-content-enhanced-mask-transformer)`

Learning Content-enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation

1 Jul 2023 · Qi Bi, ShaoDi You, Theo Gevers ·

Domain-generalized urban-scene semantic segmentation (USSS) aims to learn generalized semantic predictions across diverse urban-scene styles. Unlike domain gap challenges, USSS is unique in that the semantic categories are often similar in different urban scenes, while the styles can vary significantly due to changes in urban landscapes, weather conditions, lighting, and other factors. Existing approaches typically rely on convolutional neural networks (CNNs) to learn the content of urban scenes. In this paper, we propose a Content-enhanced Mask TransFormer (CMFormer) for domain-generalized USSS. The main idea is to enhance the focus of the fundamental component, the mask attention mechanism, in Transformer segmentation models on content information. To achieve this, we introduce a novel content-enhanced mask attention mechanism. It learns mask queries from both the image feature and its down-sampled counterpart, as lower-resolution image features usually contain more robust content information and are less sensitive to style variations. These features are fused into a Transformer decoder and integrated into a multi-resolution content-enhanced mask attention learning scheme. Extensive experiments conducted on various domain-generalized urban-scene segmentation datasets demonstrate that the proposed CMFormer significantly outperforms existing CNN-based methods for domain-generalized semantic segmentation, achieving improvements of up to 14.00\% in terms of mIoU (mean intersection over union). The source code is publicly available at \url{https://github.com/BiQiWHU/CMFormer}.

PDF Abstract

Code

Add Remove Mark official

BiQiWHU/CMFormer official

Tasks

Add Remove

Domain Adaptation

Domain Generalization

Scene Segmentation

Segmentation

Semantic Segmentation

Source-Free Domain Adaptation

Synthetic-to-Real Translation

Datasets

Cityscapes

SYNTHIA

GTA5

BDD100K

Mapillary Vistas Dataset ACDC (Adverse Conditions Dataset with Correspondences)

Results from the Paper

Edit

Ranked #2 on Synthetic-to-Real Translation on SYNTHIA-to-Cityscapes Labels

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Domain Adaptation	Cityscapes to ACDC	CMFormer	mIoU	60.1	# 8	Compare
Source-Free Domain Adaptation	Cityscapes to ACDC	CMFormer	mIoU	60.1	# 2	Compare
Domain Generalization	GTA5-to-Cityscapes	CMFormer	mIoU	55.31	# 3	Compare
Domain Generalization	GTA-to-Avg(Cityscapes,BDD,Mapillary)	CMFormer	mIoU	51.10	# 8	Compare
Synthetic-to-Real Translation	GTAV-to-Cityscapes Labels	CMFormer	mIoU	59.7	# 18	Compare
Semantic Segmentation	GTAV-to-Cityscapes Labels	CMFormer	mIoU	55.3	# 11	Compare
Synthetic-to-Real Translation	SYNTHIA-to-Cityscapes Labels	CMFormer	mIOU	44.6	# 2	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Focus • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Learning Content-enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove