TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Referring Expression Segmentation	RefCOCO	ETRIS	IoU	71.06	# 2
Referring Expression Segmentation	RefCOCO testA	ETRIS	Overall IoU	74.11	# 11
Referring Expression Segmentation	RefCOCO testB	ETRIS	Overall IoU	66.66	# 9
Referring Expression Segmentation	RefCoCo val	ETRIS	Overall IoU	71.06	# 12

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bridging-vision-and-language-encoders/referring-expression-segmentation-on-refcoco-6)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-6?p=bridging-vision-and-language-encoders)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bridging-vision-and-language-encoders/referring-expression-segmentation-on-refcoco-2)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-2?p=bridging-vision-and-language-encoders)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bridging-vision-and-language-encoders/referring-expression-segmentation-on-refcoco-1)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-1?p=bridging-vision-and-language-encoders)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bridging-vision-and-language-encoders/referring-expression-segmentation-on-refcoco)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco?p=bridging-vision-and-language-encoders)`

Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation

ICCV 2023 · Zunnan Xu, Zhihong Chen, Yong Zhang, Yibing Song, Xiang Wan, Guanbin Li ·

Parameter Efficient Tuning (PET) has gained attention for reducing the number of parameters while maintaining performance and providing better hardware resource savings, but few studies investigate dense prediction tasks and interaction between modalities. In this paper, we do an investigation of efficient tuning problems on referring image segmentation. We propose a novel adapter called Bridger to facilitate cross-modal information exchange and inject task-specific information into the pre-trained model. We also design a lightweight decoder for image segmentation. Our approach achieves comparable or superior performance with only 1.61\% to 3.38\% backbone parameter updates, evaluated on challenging benchmarks. The code is available at \url{https://github.com/kkakkkka/ETRIS}.

PDF Abstract ICCV 2023 PDF ICCV 2023 Abstract

Code

Add Remove Mark official

kkakkkka/etris official

Tasks

Add Remove

Decoder

Image Segmentation

Referring Expression Segmentation

Segmentation

Semantic Segmentation

Datasets

MS COCO

RefCOCO

Results from the Paper

Edit

Ranked #2 on Referring Expression Segmentation on RefCOCO

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Referring Expression Segmentation	RefCOCO	ETRIS	IoU	71.06	# 2	Compare
Referring Expression Segmentation	RefCOCO testA	ETRIS	Overall IoU	74.11	# 11	Compare
Referring Expression Segmentation	RefCOCO testB	ETRIS	Overall IoU	66.66	# 9	Compare
Referring Expression Segmentation	RefCoCo val	ETRIS	Overall IoU	71.06	# 12	Compare

Methods

Add Remove

Adapter

Edit Social Preview

Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove