TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	ADE20K	OCR(HRNetV2-W48)	Validation mIoU	45.66	# 179
Semantic Segmentation	ADE20K	HRNetV2 + OCR + RMI (PaddleClas pretrained)	Validation mIoU	47.98	# 147
Semantic Segmentation	ADE20K	OCR (ResNet-101)	Validation mIoU	45.28	# 185
Semantic Segmentation	ADE20K val	HRNetV2 + OCR + RMI (PaddleClas pretrained)	mIoU	47.98	# 59
Semantic Segmentation	ADE20K val	OCR (ResNet-101)	mIoU	45.28	# 76
Semantic Segmentation	ADE20K val	OCR (HRNetV2-W48)	mIoU	45.66	# 74
Semantic Segmentation	Cityscapes test	HRNetV2 + OCR +	Mean IoU (class)	84.5%	# 9
Semantic Segmentation	Cityscapes test	OCR (HRNetV2-W48, coarse)	Mean IoU (class)	83.0%	# 20
Semantic Segmentation	Cityscapes test	OCR (ResNet-101)	Mean IoU (class)	81.8%	# 34
Semantic Segmentation	Cityscapes test	HRNetV2 + OCR (w/ ASP)	Mean IoU (class)	83.7%	# 12
Semantic Segmentation	Cityscapes test	OCR (ResNet-101, coarse)	Mean IoU (class)	82.4%	# 27
Semantic Segmentation	Cityscapes val	HRNetV2 + OCR + RMI (PaddleClas pretrained)	mIoU	83.6	# 19
Semantic Segmentation	Cityscapes val	OCR (ResNet-101-FCN)	mIoU	80.6	# 41
Semantic Segmentation	COCO-Stuff test	OCR (HRNetV2-W48)	mIoU	40.5%	# 10
Semantic Segmentation	COCO-Stuff test	OCR (ResNet-101)	mIoU	39.5%	# 14
Semantic Segmentation	COCO-Stuff test	HRNetV2 + OCR + RMI (PaddleClas pretrained)	mIoU	45.2%	# 7
Semantic Segmentation	LIP val	HRNetV2 + OCR + RMI (PaddleClas pretrained)	mIoU	58.2%	# 5
Semantic Segmentation	LIP val	OCR (HRNetV2-W48)	mIoU	56.65%	# 6
Semantic Segmentation	LIP val	OCR (ResNet-101)	mIoU	55.6%	# 8
Semantic Segmentation	PASCAL Context	HRNetV2 + OCR + RMI (PaddleClas pretrained)	mIoU	59.6	# 14
Semantic Segmentation	PASCAL Context	OCR (ResNet-101)	mIoU	54.8	# 30
Semantic Segmentation	PASCAL Context	OCR (HRNetV2-W48)	mIoU	56.2	# 21
Semantic Segmentation	PASCAL VOC 2012 test	OCR (ResNet-101)	Mean IoU	84.3%	# 13
Semantic Segmentation	PASCAL VOC 2012 test	OCR (HRNetV2-W48)	Mean IoU	84.5%	# 12

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/object-contextual-representations-for/semantic-segmentation-on-lip-val)](https://paperswithcode.com/sota/semantic-segmentation-on-lip-val?p=object-contextual-representations-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/object-contextual-representations-for/semantic-segmentation-on-coco-stuff-test)](https://paperswithcode.com/sota/semantic-segmentation-on-coco-stuff-test?p=object-contextual-representations-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/object-contextual-representations-for/semantic-segmentation-on-cityscapes)](https://paperswithcode.com/sota/semantic-segmentation-on-cityscapes?p=object-contextual-representations-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/object-contextual-representations-for/semantic-segmentation-on-pascal-voc-2012)](https://paperswithcode.com/sota/semantic-segmentation-on-pascal-voc-2012?p=object-contextual-representations-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/object-contextual-representations-for/semantic-segmentation-on-pascal-context)](https://paperswithcode.com/sota/semantic-segmentation-on-pascal-context?p=object-contextual-representations-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/object-contextual-representations-for/semantic-segmentation-on-cityscapes-val)](https://paperswithcode.com/sota/semantic-segmentation-on-cityscapes-val?p=object-contextual-representations-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/object-contextual-representations-for/semantic-segmentation-on-ade20k-val)](https://paperswithcode.com/sota/semantic-segmentation-on-ade20k-val?p=object-contextual-representations-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/object-contextual-representations-for/semantic-segmentation-on-ade20k)](https://paperswithcode.com/sota/semantic-segmentation-on-ade20k?p=object-contextual-representations-for)`

Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation

ECCV 2020 · Yuhui Yuan, Xiaokang Chen, Xilin Chen, Jingdong Wang ·

In this paper, we address the semantic segmentation problem with a focus on the context aggregation strategy. Our motivation is that the label of a pixel is the category of the object that the pixel belongs to. We present a simple yet effective approach, object-contextual representations, characterizing a pixel by exploiting the representation of the corresponding object class. First, we learn object regions under the supervision of ground-truth segmentation. Second, we compute the object region representation by aggregating the representations of the pixels lying in the object region. Last, % the representation similarity we compute the relation between each pixel and each object region and augment the representation of each pixel with the object-contextual representation which is a weighted aggregation of all the object region representations according to their relations with the pixel. We empirically demonstrate that the proposed approach achieves competitive performance on various challenging semantic segmentation benchmarks: Cityscapes, ADE20K, LIP, PASCAL-Context, and COCO-Stuff. Cityscapes, ADE20K, LIP, PASCAL-Context, and COCO-Stuff. Our submission "HRNet + OCR + SegFix" achieves 1-st place on the Cityscapes leaderboard by the time of submission. Code is available at: https://git.io/openseg and https://git.io/HRNet.OCR. We rephrase the object-contextual representation scheme using the Transformer encoder-decoder framework. The details are presented in~Section3.3.

PDF Abstract ECCV 2020 PDF ECCV 2020 Abstract

Code

Add Remove Mark official

HRNet/HRNet-Semantic-Segmentation official

3,051

PaddlePaddle/PaddleSeg

8,256

open-mmlab/mmsegmentation

↳ Quickstart in

Colab

7,408

openseg-group/openseg.pytorch

1,174

mindspore-ai/models

219

See all 11 implementations

Tasks

Add Remove

Object

Segmentation

Semantic Segmentation

Datasets

MS COCO

Cityscapes

ADE20K

PASCAL Context

COCO-Stuff PASCAL VOC 2012 test

Mapillary Vistas Dataset

LIP

Results from the Paper

Edit

Ranked #5 on Semantic Segmentation on LIP val

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	ADE20K	OCR(HRNetV2-W48)	Validation mIoU	45.66	# 179	Compare
Semantic Segmentation	ADE20K	HRNetV2 + OCR + RMI (PaddleClas pretrained)	Validation mIoU	47.98	# 147	Compare
Semantic Segmentation	ADE20K	OCR (ResNet-101)	Validation mIoU	45.28	# 185	Compare
Semantic Segmentation	ADE20K val	HRNetV2 + OCR + RMI (PaddleClas pretrained)	mIoU	47.98	# 59	Compare
Semantic Segmentation	ADE20K val	OCR (ResNet-101)	mIoU	45.28	# 76	Compare
Semantic Segmentation	ADE20K val	OCR (HRNetV2-W48)	mIoU	45.66	# 74	Compare
Semantic Segmentation	Cityscapes test	HRNetV2 + OCR +	Mean IoU (class)	84.5%	# 9	Compare
Semantic Segmentation	Cityscapes test	OCR (HRNetV2-W48, coarse)	Mean IoU (class)	83.0%	# 20	Compare
Semantic Segmentation	Cityscapes test	OCR (ResNet-101)	Mean IoU (class)	81.8%	# 34	Compare
Semantic Segmentation	Cityscapes test	HRNetV2 + OCR (w/ ASP)	Mean IoU (class)	83.7%	# 12	Compare
Semantic Segmentation	Cityscapes test	OCR (ResNet-101, coarse)	Mean IoU (class)	82.4%	# 27	Compare
Semantic Segmentation	Cityscapes val	HRNetV2 + OCR + RMI (PaddleClas pretrained)	mIoU	83.6	# 19	Compare
Semantic Segmentation	Cityscapes val	OCR (ResNet-101-FCN)	mIoU	80.6	# 41	Compare
Semantic Segmentation	COCO-Stuff test	OCR (HRNetV2-W48)	mIoU	40.5%	# 10	Compare
Semantic Segmentation	COCO-Stuff test	OCR (ResNet-101)	mIoU	39.5%	# 14	Compare
Semantic Segmentation	COCO-Stuff test	HRNetV2 + OCR + RMI (PaddleClas pretrained)	mIoU	45.2%	# 7	Compare
Semantic Segmentation	LIP val	HRNetV2 + OCR + RMI (PaddleClas pretrained)	mIoU	58.2%	# 5	Compare
Semantic Segmentation	LIP val	OCR (HRNetV2-W48)	mIoU	56.65%	# 6	Compare
Semantic Segmentation	LIP val	OCR (ResNet-101)	mIoU	55.6%	# 8	Compare
Semantic Segmentation	PASCAL Context	HRNetV2 + OCR + RMI (PaddleClas pretrained)	mIoU	59.6	# 14	Compare
Semantic Segmentation	PASCAL Context	OCR (ResNet-101)	mIoU	54.8	# 30	Compare
Semantic Segmentation	PASCAL Context	OCR (HRNetV2-W48)	mIoU	56.2	# 21	Compare
Semantic Segmentation	PASCAL VOC 2012 test	OCR (ResNet-101)	Mean IoU	84.3%	# 13	Compare
Semantic Segmentation	PASCAL VOC 2012 test	OCR (HRNetV2-W48)	Mean IoU	84.5%	# 12	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet

Edit Social Preview

Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove