TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Real-Time Semantic Segmentation	CamVid	RTFormer-Slim	mIoU	81.4	# 2
Real-Time Semantic Segmentation	CamVid	RTFormer-Slim	Frame (fps)	190.7(2080Ti)	# 18
Semantic Segmentation	CamVid	RTFormer-Base	Mean IoU	82.5	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rtformer-efficient-design-for-real-time/real-time-semantic-segmentation-on-camvid)](https://paperswithcode.com/sota/real-time-semantic-segmentation-on-camvid?p=rtformer-efficient-design-for-real-time)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rtformer-efficient-design-for-real-time/semantic-segmentation-on-camvid)](https://paperswithcode.com/sota/semantic-segmentation-on-camvid?p=rtformer-efficient-design-for-real-time)`

RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer

13 Oct 2022 · Jian Wang, Chenhui Gou, Qiman Wu, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang ·

Recently, transformer-based networks have shown impressive results in semantic segmentation. Yet for real-time semantic segmentation, pure CNN-based approaches still dominate in this field, due to the time-consuming computation mechanism of transformer. We propose RTFormer, an efficient dual-resolution transformer for real-time semantic segmenation, which achieves better trade-off between performance and efficiency than CNN-based models. To achieve high inference efficiency on GPU-like devices, our RTFormer leverages GPU-Friendly Attention with linear complexity and discards the multi-head mechanism. Besides, we find that cross-resolution attention is more efficient to gather global context information for high-resolution branch by spreading the high level knowledge learned from low-resolution branch. Extensive experiments on mainstream benchmarks demonstrate the effectiveness of our proposed RTFormer, it achieves state-of-the-art on Cityscapes, CamVid and COCOStuff, and shows promising results on ADE20K. Code is available at PaddleSeg: https://github.com/PaddlePaddle/PaddleSeg.

PDF Abstract

Code

Add Remove Mark official

PaddlePaddle/PaddleSeg official

8,262

Tasks

Add Remove

Real-Time Semantic Segmentation

Segmentation

Semantic Segmentation

Datasets

ImageNet

Cityscapes

COCO-Stuff

CamVid

Results from the Paper

Edit

Ranked #2 on Real-Time Semantic Segmentation on CamVid

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Real-Time Semantic Segmentation	CamVid	RTFormer-Slim	mIoU	81.4	# 2	Compare
Real-Time Semantic Segmentation	CamVid	RTFormer-Slim	Frame (fps)	190.7(2080Ti)	# 18	Compare
Semantic Segmentation	CamVid	RTFormer-Base	Mean IoU	82.5	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove