TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Referring Expression Segmentation	PhraseCut	HULANet	Mean IoU	41.3	# 4
Referring Expression Segmentation	PhraseCut	HULANet	Pr@0.5	42.9	# 2
Referring Expression Segmentation	PhraseCut	HULANet	Pr@0.7	27.8	# 2
Referring Expression Segmentation	PhraseCut	HULANet	Pr@0.9	5.9	# 2
Referring Expression Segmentation	PhraseCut	MattNet	Mean IoU	20.2	# 6
Referring Expression Segmentation	PhraseCut	MattNet	Pr@0.5	19.7	# 4
Referring Expression Segmentation	PhraseCut	MattNet	Pr@0.7	13.5	# 3
Referring Expression Segmentation	PhraseCut	MattNet	Pr@0.9	3	# 3
Referring Expression Segmentation	PhraseCut	RMI	Mean IoU	21.1	# 5
Referring Expression Segmentation	PhraseCut	RMI	Pr@0.5	22	# 3
Referring Expression Segmentation	PhraseCut	RMI	Pr@0.7	11.6	# 4
Referring Expression Segmentation	PhraseCut	RMI	Pr@0.9	1.5	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/phrasecut-language-based-image-segmentation-1/referring-expression-segmentation-on)](https://paperswithcode.com/sota/referring-expression-segmentation-on?p=phrasecut-language-based-image-segmentation-1)`

PhraseCut: Language-based Image Segmentation in the Wild

CVPR 2020 · Chenyun Wu, Zhe Lin, Scott Cohen, Trung Bui, Subhransu Maji ·

We consider the problem of segmenting image regions given a natural language phrase, and study it on a novel dataset of 77,262 images and 345,486 phrase-region pairs. Our dataset is collected on top of the Visual Genome dataset and uses the existing annotations to generate a challenging set of referring phrases for which the corresponding regions are manually annotated. Phrases in our dataset correspond to multiple regions and describe a large number of object and stuff categories as well as their attributes such as color, shape, parts, and relationships with other entities in the image. Our experiments show that the scale and diversity of concepts in our dataset poses significant challenges to the existing state-of-the-art. We systematically handle the long-tail nature of these concepts and present a modular approach to combine category, attribute, and relationship cues that outperforms existing approaches.

PDF Abstract CVPR 2020 PDF CVPR 2020 Abstract

Code

Add Remove Mark official

ChenyunWu/PhraseCutDataset official

Tasks

Add Remove

Attribute

Image Segmentation

Referring Expression Segmentation

Semantic Segmentation

Datasets

Introduced in the Paper:

PhraseCut

Used in the Paper:

MS COCO

Visual Genome

RefCOCO

Flickr30K Entities Google Refexp

Results from the Paper

Edit

Ranked #4 on Referring Expression Segmentation on PhraseCut

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Referring Expression Segmentation	PhraseCut	HULANet	Mean IoU	41.3	# 4	Compare
			Pr@0.5	42.9	# 2	Compare
			Pr@0.7	27.8	# 2	Compare
			Pr@0.9	5.9	# 2	Compare
Referring Expression Segmentation	PhraseCut	MattNet	Mean IoU	20.2	# 6	Compare
			Pr@0.5	19.7	# 4	Compare
			Pr@0.7	13.5	# 3	Compare
			Pr@0.9	3	# 3	Compare
Referring Expression Segmentation	PhraseCut	RMI	Mean IoU	21.1	# 5	Compare
			Pr@0.5	22	# 3	Compare
			Pr@0.7	11.6	# 4	Compare
			Pr@0.9	1.5	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

PhraseCut: Language-based Image Segmentation in the Wild

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove