Semantic Segmentation

5248 papers with code • 125 benchmarks • 312 datasets

Semantic Segmentation is a computer vision task in which the goal is to categorize each pixel in an image into a class or object. The goal is to produce a dense pixel-wise segmentation map of an image, where each pixel is assigned to a specific class or object. Some example benchmarks for this task are Cityscapes, PASCAL VOC and ADE20K. Models are usually evaluated with the Mean Intersection-Over-Union (Mean IoU) and Pixel Accuracy metrics.

( Image credit: CSAILVision )

Benchmarks

Add a Result

These leaderboards are used to track progress in Semantic Segmentation

Dataset	Best Model	Compare
ADE20K	ONE-PEACE	See all
NYU Depth v2	OmniVec	See all
Cityscapes test	VLTSeg	See all
ADE20K val	BEiT-3	See all
Cityscapes val	SERNet-Former	See all
PASCAL Context	PlainSeg (EVA-02-L)	See all
S3DIS	PTv3 + PPT	See all
S3DIS Area5	OmniVec	See all
PASCAL VOC 2012 test	DeepLabv3+ (Xception-65-JFT)	See all
SUN-RGBD	TokenFusion (S)	See all
DensePASS	Trans4PASS+ (multi-scale)	See all
ScanNet	PTv3 + PPT	See all
PASCAL VOC 2012 val	EfficientNet-L2+NAS-FPN (single scale test, with self-training)	See all
DADA-seg	MMUDA	See all
Stanford2D3D Panoramic	SFSS-MMSI (RGB+HHA)	See all
ImageNet-S	TEC (ViT-B/16, 224x224, SSL+FT, mmseg)	See all
LaRS	SWIM^2 (Mask2Former)	See all
CamVid	SERNet-Former	See all
COCO-Stuff test	EVA	See all
iSAID	SegNeXt-L	See all
Semantic3D	Feature Geometric Net	See all
ISPRS Potsdam	AerialFormer-B	See all
Trans10K	Trans4Trans (M)	See all
Dark Zurich	Refign (HRDA)	See all
KITTI-360	CMNeXt (RGB-D-E-LiDAR)	See all
MCubeS	MMSFormer (RGB-A-D-N)	See all
DeLiVER	CMNeXt (RGB-D-E-LiDAR)	See all
UrbanLF	CMNeXt (RGB-LF80)	See all
LIP val	Hulk(Finetune, ViT-L)	See all
ScanNetV2	CMX	See all
GTAV-to-Cityscapes Labels	MIC	See all
Nighttime Driving	TADP	See all
LoveDA	ViT-G12X4	See all
EventScape	CMX (B4)	See all
FMB Dataset	MMSFormer (RGB-Infrared)	See all
ISPRS Vaihingen	LSKNet-S	See all
SpaceNet 1	MAE+MTP(ViT-L)	See all
ZJU-RGB-P	ShareCMP (B4 RGB-FP)	See all
INRIA Aerial Image Labeling	UANet(PVT-V2-B2)	See all
LLRGBD-synthetic	SMMCL (SegNeXt-B)	See all
UPLight	ShareCMP (B2 RGB-FP)	See all
MCubeS (P)	MMSFormer (RGB-A-D)	See all
SpectralWaste	CMX (RGB-HYPER)	See all
DDD17	CMNeXt	See all
DSEC	CMNeXt	See all
KITTI Semantic Segmentation	RPVNet [xu2021rpvnet]	See all
SkyScapes-Dense	SkyScapesNet-Dense	See all
FoodSeg103	FoodSAM	See all
SYNTHIA-to-Cityscapes	HRDA + PiPa	See all
SynPASS	Trans4PASS+	See all
SELMA	CMX	See all
Pothole Mix	Baseline - DeepLabv3+	See all
DELIVER	CMNeXt (RGB-D-E-LiDAR)	See all
Mapillary val	AO-SegNet	See all
MS COCO	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
Stanford2D3D - RGBD	CMX (SegFormer-B4)	See all
Event-based Segmentation Dataset	Bimodal SegNet	See all
GAMUS	TIMF	See all
ACDC Scribbles	ScribFormer	See all
ShapeNet	PatchFormer	See all
UAVid	LSKNet-S	See all
BIG	PSPNet + CascadePSP	See all
PETRAW	NCC Next	See all
Hypersim	MultiMAE (ViT-B)	See all
Structured3D	SFSS-MMSI (RGB+Depth+Normal)	See all
Matterport3D	SFSS-MMSI (RGB+Depth)	See all
CC3M-TagMask	TTD (TCL)	See all
PASCAL VOC 2011 test	Plugin network	See all
RELLIS-3D Dataset	GA-Nav	See all
PASTIS	Exchanger+Mask2Former	See all
SIFT-flow	RBE2E	See all
Stanford2D3D Panoramic - RGBD	CBFC	See all
Toronto-3D L002	SCF-Net	See all
Montgomery County X-ray Set	UNETR + SS-CXR	See all
dacl10k v1 testdev	FPN EfficientNet-B4 w/ Aux loss	See all
SYNTHIA-CVPR’16	SSMA	See all
Freiburg Forest	SSMA	See all
38-Cloud	Cloud-Net+	See all
PASCAL VOC 2007	GALDNet	See all
SkyScapes-Lane	SkyScapesNet-Lane	See all
Kvasir-Instrument	UNet	See all
Graz-02	VOLO-D5	See all
Cleargrasp (Novel)	Cleargrasp	See all
Cityscapes	SPFNet34M	See all
Endoscapes	MoCo V2 Surg SSL - DeepLabv3+ head	See all
HERA RFI Detection	Nearest Latent Neighbours	See all
LOFAR RFI Detection	Nearest Latent Neighbours	See all
BDD	FasterSeg	See all
COCO-Stuff	Deeplab v2	See all
Cam2BEV	uNetXST	See all
ApolloScape	ERFNet-IntRA-KD (ours)	See all
DroneDeploy	DLv3+ (Xception65)	See all
ManipalUAVid	UVid-Net	See all
Cityscapes VIPriors subset	EfficientSeg	See all
SBCoseg	Dice loss + IS-Triplet loss	See all
PASCAL VOC 2010 test	SIW	See all
PASCAL VOC 2012	DLDL-8s+CRF	See all
COCO-Stuff full	SegFormer-B5 (Single Scale)	See all
PASCAL VOC 2011	DLDL-8s+CRF	See all
AIRS	ICT-Net	See all
WildDash	SIW	See all
OpenEDS	RITnet	See all
SYNTHIA	CGA-Net	See all
PASCAL VOC	SegCLIP	See all
UTFPR-SBD3	EPYNET	See all
DIVA-HisDB	U-Net	See all
ATLANTIS	Erfani et al.	See all
PH2	MFSNet	See all
ISIC 2017	MFSNet	See all
HAM10000	MFSNet	See all
Mila Simulated Floods	FloodTransformer (Ours)	See all
SWIMSEG	ACLNet	See all
SWINSEG	ACLNet	See all
SWINySEG	ACLNet	See all
MixedWM38	WaferSegClassNet	See all
BDD100K val	NiseNet	See all
PASTIS-R	Late Fusion	See all
Cityscapes 3D	TaskPrompter	See all
FLAIR (French Land cover from Aerospace ImageRy)	U-Net baseline	See all
RUGD	GA-Nav	See all
dacl10k v1 testfinal	FPN EfficientNet-B4	See all
SemanticPOSS	TFNet	See all
COCO-Stuff-27	DiffSeg (512)	See all
Forward-Looking Sonar Marine Debris Datasets	Unet+RN34	See all
STARE	UNet	See all

Show all 125 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Semantic Segmentation models and implementations

PaddlePaddle/PaddleSeg

53 papers

8,282

rwightman/pytorch-image-models

33 papers

29,908

osmr/imgclsmob

30 papers

2,924

open-mmlab/mmsegmentation

19 papers

7,454

See all 39 libraries.

Datasets

Subtasks

Weakly-Supervised Semantic Segmentation

Scene Segmentation

Semi-Supervised Semantic Segmentation

Real-Time Semantic Segmentation

3D Part Segmentation

Unsupervised Semantic Segmentation

Road Segmentation

One-Shot Segmentation

Bird's-Eye View Semantic Segmentation

Crack Segmentation

UNET Segmentation

Universal Segmentation

Class-Incremental Semantic Segmentation

Polyp Segmentation

Vision-Language Segmentation

4D Spatio Temporal Semantic Segmentation

Histopathological Segmentation

Attentive segmentation networks

Text-Line Extraction

Aerial Video Semantic Segmentation

Amodal Panoptic Segmentation

Robust BEV Map Segmentation

Latest papers

Most implemented Social Latest No code

GLIMS: Attention-Guided Lightweight Multi-Scale Hybrid Network for Volumetric Semantic Segmentation

yaziciz/GLIMS • 27 Apr 2024

Notably, GLIMS achieved this high performance with a significantly reduced number of trainable parameters.

27 Apr 2024

Paper
Code

Multi-Scale Representations by Varying Window Attention for Semantic Segmentation

yan-hao-tian/lawin • • 25 Apr 2024

VWA leverages the local window attention (LWA) and disentangles LWA into the query window and context window, allowing the context's scale to vary for the query to learn representations at multiple scales.

110

25 Apr 2024

Paper
Code

A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation

emi-group/evoxbench • • 25 Apr 2024

To bridge the gap, we introduce a tailored streamline to transform the task of HW-NAS for real-time semantic segmentation into standard MOPs.

25 Apr 2024

Paper
Code

Multimodal Information Interaction for Medical Image Segmentation

fxxjuses/micformer • • 25 Apr 2024

To address this issue, we introduce an innovative Multimodal Information Cross Transformer (MicFormer), which employs a dual-stream architecture to simultaneously extract features from each modality.

25 Apr 2024

Paper
Code

Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals

visinf/primaps • • 25 Apr 2024

Unsupervised semantic segmentation aims to automatically partition images into semantically meaningful regions by identifying global categories within an image corpus without any form of annotation.

25 Apr 2024

Paper
Code

Self-Balanced R-CNN for Instance Segmentation

IMPLabUniPr/mmdetection • • 25 Apr 2024

Current state-of-the-art two-stage models on instance segmentation task suffer from several types of imbalances.

25 Apr 2024

Paper
Code

Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segmentation

stanfordmlgroup/auto-generate-wls • • 25 Apr 2024

The high cost of creating pixel-by-pixel gold-standard labels, limited expert availability, and presence of diverse tasks make it challenging to generate segmentation labels to train deep learning models for medical imaging tasks.

25 Apr 2024

Paper
Code

OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation

crystalwlz/omegas • • 24 Apr 2024

Current scene reconstruction techniques frequently result in the loss of object detail textures and are unable to reconstruct object portions that are occluded or unseen in views.

24 Apr 2024

Paper
Code

Vision Transformer-based Adversarial Domain Adaptation

lluckyyh/vt-ada • • 24 Apr 2024

Unsupervised domain adaptation (UDA) aims to transfer knowledge from a labeled source domain to an unlabeled target domain.

24 Apr 2024

Paper
Code

Surgical-DeSAM: Decoupling SAM for Instrument Segmentation in Robotic Surgery

yuyangsheng/surgical-desam • • 22 Apr 2024

We utilise a commonly used detection architecture, DETR, and fine-tuned it to obtain bounding box prompt for the instruments.

22 Apr 2024

Paper
Code

Semantic Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result