TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semi-Supervised Semantic Segmentation	ADE20K 1/16 labeled	SemiVL	Validation mIoU	37.2	# 1
Semi-Supervised Semantic Segmentation	ADE20K 1/32 labeled	SemiVL	Validation mIoU	35.1	# 1
Semi-Supervised Semantic Segmentation	Cityscapes 100 samples labeled	SemiVL (ViT-B/16)	Validation mIoU	76.2	# 1
Semi-Supervised Semantic Segmentation	Cityscapes 12.5% labeled	SemiVL (ViT-B/16)	Validation mIoU	79.4%	# 1
Semi-Supervised Semantic Segmentation	Cityscapes 25% labeled	SemiVL (ViT-B/16)	Validation mIoU	80.3%	# 1
Semi-Supervised Semantic Segmentation	Cityscapes 50% labeled	SemiVL (ViT-B/16)	Validation mIoU	80.6%	# 1
Semi-Supervised Semantic Segmentation	Cityscapes 6.25% labeled	SemiVL (ViT-B/16)	Validation mIoU	77.9	# 1
Semi-Supervised Semantic Segmentation	COCO 1/128 labeled	SemiVL	Validation mIoU	53.6	# 1
Semi-Supervised Semantic Segmentation	COCO 1/256 labeled	SemiVL	Validation mIoU	52.8	# 1
Semi-Supervised Semantic Segmentation	COCO 1/32 labeled	SemiVL	Validation mIoU	56.5	# 1
Semi-Supervised Semantic Segmentation	COCO 1/512 labeled	SemiVL	Validation mIoU	50.1	# 1
Semi-Supervised Semantic Segmentation	COCO 1/64 labeled	SemiVL	Validation mIoU	55.4	# 1
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 1464 labels	SemiVL (ViT-B/16	Validation mIoU	87.3	# 1
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 1464 labels	UniMatch (ViT-B/16)	Validation mIoU	84.0	# 2
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 183 labeled	SemiVL (ViT-B/16)	Validation mIoU	85.6	# 1
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 183 labeled	UniMatch (ViT-B/16)	Validation mIoU	80.1	# 2
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 366 labeled	UniMatch (ViT-B/16)	Validation mIoU	82.0	# 2
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 366 labeled	SemiVL (ViT-B/16)	Validation mIoU	86.0	# 1
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 732 labeled	SemiVL (ViT-B/16)	Validation mIoU	86.7	# 1
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 732 labeled	UniMatch (ViT-B/16)	Validation mIoU	83.3	# 2
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 92 labeled	SemiVL (ViT-B/16)	Validation mIoU	84.0	# 1
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 92 labeled	UniMatch (ViT-B/16)	Validation mIoU	77.9	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-42)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-42?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-41)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-41?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-3)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-3?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-2)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-2?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-1)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-1?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-8)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-8?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-22)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-22?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-coco-2)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-coco-2?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-coco-1)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-coco-1?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-coco-4)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-coco-4?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-coco)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-coco?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-coco-3)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-coco-3?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-10)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-10?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-28)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-28?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-29)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-29?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-30)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-30?p=semivl-semi-supervised-semantic-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semivl-semi-supervised-semantic-segmentation/semi-supervised-semantic-segmentation-on-27)](https://paperswithcode.com/sota/semi-supervised-semantic-segmentation-on-27?p=semivl-semi-supervised-semantic-segmentation)`

SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance

27 Nov 2023 · Lukas Hoyer, David Joseph Tan, Muhammad Ferjad Naeem, Luc van Gool, Federico Tombari ·

In semi-supervised semantic segmentation, a model is trained with a limited number of labeled images along with a large corpus of unlabeled images to reduce the high annotation effort. While previous methods are able to learn good segmentation boundaries, they are prone to confuse classes with similar visual appearance due to the limited supervision. On the other hand, vision-language models (VLMs) are able to learn diverse semantic knowledge from image-caption datasets but produce noisy segmentation due to the image-level training. In SemiVL, we propose to integrate rich priors from VLM pre-training into semi-supervised semantic segmentation to learn better semantic decision boundaries. To adapt the VLM from global to local reasoning, we introduce a spatial fine-tuning strategy for label-efficient learning. Further, we design a language-guided decoder to jointly reason over vision and language. Finally, we propose to handle inherent ambiguities in class labels by providing the model with language guidance in the form of class definitions. We evaluate SemiVL on 4 semantic segmentation datasets, where it significantly outperforms previous semi-supervised methods. For instance, SemiVL improves the state-of-the-art by +13.5 mIoU on COCO with 232 annotated images and by +6.1 mIoU on Pascal VOC with 92 labels. Project page: https://github.com/google-research/semivl

PDF Abstract

Code

Add Remove Mark official

google-research/semivl official

Tasks

Add Remove

Segmentation

Semantic Segmentation

Semi-Supervised Semantic Segmentation

Datasets

Cityscapes

ADE20K

Results from the Paper

Edit

Ranked #1 on Semi-Supervised Semantic Segmentation on PASCAL VOC 2012 732 labeled (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semi-Supervised Semantic Segmentation	ADE20K 1/16 labeled	SemiVL	Validation mIoU	37.2	# 1	Compare
Semi-Supervised Semantic Segmentation	ADE20K 1/32 labeled	SemiVL	Validation mIoU	35.1	# 1	Compare
Semi-Supervised Semantic Segmentation	Cityscapes 100 samples labeled	SemiVL (ViT-B/16)	Validation mIoU	76.2	# 1	Compare
Semi-Supervised Semantic Segmentation	Cityscapes 12.5% labeled	SemiVL (ViT-B/16)	Validation mIoU	79.4%	# 1	Compare
Semi-Supervised Semantic Segmentation	Cityscapes 25% labeled	SemiVL (ViT-B/16)	Validation mIoU	80.3%	# 1	Compare
Semi-Supervised Semantic Segmentation	Cityscapes 50% labeled	SemiVL (ViT-B/16)	Validation mIoU	80.6%	# 1	Compare
Semi-Supervised Semantic Segmentation	Cityscapes 6.25% labeled	SemiVL (ViT-B/16)	Validation mIoU	77.9	# 1	Compare
Semi-Supervised Semantic Segmentation	COCO 1/128 labeled	SemiVL	Validation mIoU	53.6	# 1	Compare
Semi-Supervised Semantic Segmentation	COCO 1/256 labeled	SemiVL	Validation mIoU	52.8	# 1	Compare
Semi-Supervised Semantic Segmentation	COCO 1/32 labeled	SemiVL	Validation mIoU	56.5	# 1	Compare
Semi-Supervised Semantic Segmentation	COCO 1/512 labeled	SemiVL	Validation mIoU	50.1	# 1	Compare
Semi-Supervised Semantic Segmentation	COCO 1/64 labeled	SemiVL	Validation mIoU	55.4	# 1	Compare
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 1464 labels	SemiVL (ViT-B/16	Validation mIoU	87.3	# 1	Compare
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 1464 labels	UniMatch (ViT-B/16)	Validation mIoU	84.0	# 2	Compare
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 183 labeled	SemiVL (ViT-B/16)	Validation mIoU	85.6	# 1	Compare
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 183 labeled	UniMatch (ViT-B/16)	Validation mIoU	80.1	# 2	Compare
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 366 labeled	UniMatch (ViT-B/16)	Validation mIoU	82.0	# 2	Compare
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 366 labeled	SemiVL (ViT-B/16)	Validation mIoU	86.0	# 1	Compare
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 732 labeled	SemiVL (ViT-B/16)	Validation mIoU	86.7	# 1	Compare
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 732 labeled	UniMatch (ViT-B/16)	Validation mIoU	83.3	# 2	Compare
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 92 labeled	SemiVL (ViT-B/16)	Validation mIoU	84.0	# 1	Compare
Semi-Supervised Semantic Segmentation	PASCAL VOC 2012 92 labeled	UniMatch (ViT-B/16)	Validation mIoU	77.9	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove