TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	iSAID	AerialFormer-B	mIoU	69.3	# 3
Semantic Segmentation	iSAID	AerialFormer-T	mIoU	67.5	# 9
Semantic Segmentation	iSAID	AerialFormer-S	mIoU	68.4	# 5
Semantic Segmentation	ISPRS Potsdam	AerialFormer-B	Overall Accuracy	93.9	# 1
Semantic Segmentation	ISPRS Potsdam	AerialFormer-B	Mean F1	94.1	# 1
Semantic Segmentation	ISPRS Potsdam	AerialFormer-B	Mean IoU	89.1	# 1
Semantic Segmentation	LoveDA	AerialFormer-B	Category mIoU	54.1	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/aerialformer-multi-resolution-transformer-for/semantic-segmentation-on-isprs-potsdam)](https://paperswithcode.com/sota/semantic-segmentation-on-isprs-potsdam?p=aerialformer-multi-resolution-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/aerialformer-multi-resolution-transformer-for/semantic-segmentation-on-isaid)](https://paperswithcode.com/sota/semantic-segmentation-on-isaid?p=aerialformer-multi-resolution-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/aerialformer-multi-resolution-transformer-for/semantic-segmentation-on-loveda)](https://paperswithcode.com/sota/semantic-segmentation-on-loveda?p=aerialformer-multi-resolution-transformer-for)`

AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

12 Jun 2023 · Kashu Yamazaki, Taisei Hanyu, Minh Tran, Adrian de Luis, Roy McCann, Haitao Liao, Chase Rainwater, Meredith Adkins, Jackson Cothren, Ngan Le ·

Aerial Image Segmentation is a top-down perspective semantic segmentation and has several challenging characteristics such as strong imbalance in the foreground-background distribution, complex background, intra-class heterogeneity, inter-class homogeneity, and tiny objects. To handle these problems, we inherit the advantages of Transformers and propose AerialFormer, which unifies Transformers at the contracting path with lightweight Multi-Dilated Convolutional Neural Networks (MD-CNNs) at the expanding path. Our AerialFormer is designed as a hierarchical structure, in which Transformer encoder outputs multi-scale features and MD-CNNs decoder aggregates information from the multi-scales. Thus, it takes both local and global contexts into consideration to render powerful representations and high-resolution segmentation. We have benchmarked AerialFormer on three common datasets including iSAID, LoveDA, and Potsdam. Comprehensive experiments and extensive ablation studies show that our proposed AerialFormer outperforms previous state-of-the-art methods with remarkable performance. Our source code will be publicly available upon acceptance.

PDF Abstract

Code

Add Remove Mark official

UARK-AICV/AerialFormer official

Tasks

Add Remove

Image Segmentation

Segmentation

Semantic Segmentation

Datasets

iSAID

LoveDA

ISPRS Potsdam

Results from the Paper

Add Remove

Ranked #1 on Semantic Segmentation on ISPRS Potsdam

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	iSAID	AerialFormer-B	mIoU	69.3	# 3	Compare
Semantic Segmentation	iSAID	AerialFormer-T	mIoU	67.5	# 9	Compare
Semantic Segmentation	iSAID	AerialFormer-S	mIoU	68.4	# 5	Compare
Semantic Segmentation	ISPRS Potsdam	AerialFormer-B	Overall Accuracy	93.9	# 1	Compare
			Mean F1	94.1	# 1	Compare
			Mean IoU	89.1	# 1	Compare
Semantic Segmentation	LoveDA	AerialFormer-B	Category mIoU	54.1	# 4	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove