TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Generalized Referring Expression Segmentation	gRefCOCO	CRIS	gIoU	56.27	# 4
Generalized Referring Expression Segmentation	gRefCOCO	CRIS	cIoU	55.34	# 3
Referring Expression Segmentation	RefCOCO testA	CRIS	Overall IoU	73.18	# 13
Referring Expression Segmentation	RefCOCO+ testA	CRIS	Overall IoU	68.08	# 11
Referring Expression Segmentation	RefCOCO testB	CRIS	Overall IoU	66.1	# 11
Referring Expression Segmentation	RefCOCO+ test B	CRIS	Overall IoU	53.68	# 12
Referring Expression Segmentation	RefCoCo val	CRIS	Overall IoU	70.47	# 14
Referring Expression Segmentation	RefCoCo val	CRIS	Overall IoU	70.47	# 10
Referring Expression Segmentation	RefCOCO+ val	CRIS	Overall IoU	62.27	# 12

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cris-clip-driven-referring-image-segmentation/generalized-referring-expression-segmentation)](https://paperswithcode.com/sota/generalized-referring-expression-segmentation?p=cris-clip-driven-referring-image-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cris-clip-driven-referring-image-segmentation/referring-expression-segmentation-on-refcoco-7)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-7?p=cris-clip-driven-referring-image-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cris-clip-driven-referring-image-segmentation/referring-expression-segmentation-on-refcoco-4)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-4?p=cris-clip-driven-referring-image-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cris-clip-driven-referring-image-segmentation/referring-expression-segmentation-on-refcoco-2)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-2?p=cris-clip-driven-referring-image-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cris-clip-driven-referring-image-segmentation/referring-expression-segmentation-on-refcoco-5)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-5?p=cris-clip-driven-referring-image-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cris-clip-driven-referring-image-segmentation/referring-expression-segmentation-on-refcoco-3)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-3?p=cris-clip-driven-referring-image-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cris-clip-driven-referring-image-segmentation/referring-expression-segmentation-on-refcoco-1)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-1?p=cris-clip-driven-referring-image-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cris-clip-driven-referring-image-segmentation/referring-expression-segmentation-on-refcoco)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco?p=cris-clip-driven-referring-image-segmentation)`

CRIS: CLIP-Driven Referring Image Segmentation

CVPR 2022 · Zhaoqing Wang, Yu Lu, Qiang Li, Xunqiang Tao, Yandong Guo, Mingming Gong, Tongliang Liu ·

Referring image segmentation aims to segment a referent via a natural linguistic expression.Due to the distinct data properties between text and image, it is challenging for a network to well align text and pixel-level features. Existing approaches use pretrained models to facilitate learning, yet separately transfer the language/vision knowledge from pretrained models, ignoring the multi-modal corresponding information. Inspired by the recent advance in Contrastive Language-Image Pretraining (CLIP), in this paper, we propose an end-to-end CLIP-Driven Referring Image Segmentation framework (CRIS). To transfer the multi-modal knowledge effectively, CRIS resorts to vision-language decoding and contrastive learning for achieving the text-to-pixel alignment. More specifically, we design a vision-language decoder to propagate fine-grained semantic information from textual representations to each pixel-level activation, which promotes consistency between the two modalities. In addition, we present text-to-pixel contrastive learning to explicitly enforce the text feature similar to the related pixel-level features and dissimilar to the irrelevances. The experimental results on three benchmark datasets demonstrate that our proposed framework significantly outperforms the state-of-the-art performance without any post-processing. The code will be released.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

DerrickWang005/CRIS.pytorch official

226

Tasks

Add Remove

Contrastive Learning

Generalized Referring Expression Segmentation

Image Segmentation

Referring Expression Segmentation

Segmentation

Semantic Segmentation

Datasets

RefCOCO

gRefCOCO

Results from the Paper

Edit

Ranked #4 on Generalized Referring Expression Segmentation on gRefCOCO

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Generalized Referring Expression Segmentation	gRefCOCO	CRIS	gIoU	56.27	# 4	Compare
Generalized Referring Expression Segmentation	gRefCOCO	CRIS	cIoU	55.34	# 3	Compare
Referring Expression Segmentation	RefCOCO testA	CRIS	Overall IoU	73.18	# 13	Compare
Referring Expression Segmentation	RefCOCO+ testA	CRIS	Overall IoU	68.08	# 11	Compare
Referring Expression Segmentation	RefCOCO testB	CRIS	Overall IoU	66.1	# 11	Compare
Referring Expression Segmentation	RefCOCO+ test B	CRIS	Overall IoU	53.68	# 12	Compare
Referring Expression Segmentation	RefCoCo val	CRIS	Overall IoU	70.47	# 14	Compare
Referring Expression Segmentation	RefCoCo val	CRIS	Overall IoU	70.47	# 10	Compare
Referring Expression Segmentation	RefCOCO+ val	CRIS	Overall IoU	62.27	# 12	Compare

Methods

Add Remove

Contrastive Learning

Edit Social Preview

CRIS: CLIP-Driven Referring Image Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove