TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text based Person Retrieval	CUHK-PEDES	APTM	R@1	76.53	# 1
Text based Person Retrieval	CUHK-PEDES	APTM	R@10	94.15	# 2
Text based Person Retrieval	CUHK-PEDES	APTM	R@5	90.04	# 3
Text based Person Retrieval	CUHK-PEDES	APTM	mAP	66.91	# 5
Text based Person Retrieval	ICFG-PEDES	APTM	mAP	41.22	# 2
Text based Person Retrieval	ICFG-PEDES	APTM	R@1	68.51	# 1
Text based Person Retrieval	RSTPReid	APTM	R@1	67.50	# 1
Text based Person Retrieval	RSTPReid	APTM	R@5	85.70	# 3
Text based Person Retrieval	RSTPReid	APTM	R@10	91.45	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/towards-unified-text-based-person-retrieval-a/nlp-based-person-retrival-on-cuhk-pedes)](https://paperswithcode.com/sota/nlp-based-person-retrival-on-cuhk-pedes?p=towards-unified-text-based-person-retrieval-a)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/towards-unified-text-based-person-retrieval-a/text-based-person-retrieval-on-icfg-pedes)](https://paperswithcode.com/sota/text-based-person-retrieval-on-icfg-pedes?p=towards-unified-text-based-person-retrieval-a)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/towards-unified-text-based-person-retrieval-a/text-based-person-retrieval-on-rstpreid-1)](https://paperswithcode.com/sota/text-based-person-retrieval-on-rstpreid-1?p=towards-unified-text-based-person-retrieval-a)`

Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark

5 Jun 2023 · Shuyu Yang, Yinan Zhou, Yaxiong Wang, Yujiao Wu, Li Zhu, Zhedong Zheng ·

In this paper, we introduce a large Multi-Attribute and Language Search dataset for text-based person retrieval, called MALS, and explore the feasibility of performing pre-training on both attribute recognition and image-text matching tasks in one stone. In particular, MALS contains 1,510,330 image-text pairs, which is about 37.5 times larger than prevailing CUHK-PEDES, and all images are annotated with 27 attributes. Considering the privacy concerns and annotation costs, we leverage the off-the-shelf diffusion models to generate the dataset. To verify the feasibility of learning from the generated data, we develop a new joint Attribute Prompt Learning and Text Matching Learning (APTM) framework, considering the shared knowledge between attribute and text. As the name implies, APTM contains an attribute prompt learning stream and a text matching learning stream. (1) The attribute prompt learning leverages the attribute prompts for image-attribute alignment, which enhances the text matching learning. (2) The text matching learning facilitates the representation learning on fine-grained details, and in turn, boosts the attribute prompt learning. Extensive experiments validate the effectiveness of the pre-training on MALS, achieving state-of-the-art retrieval performance via APTM on three challenging real-world benchmarks. In particular, APTM achieves a consistent improvement of +6.96%, +7.68%, and +16.95% Recall@1 accuracy on CUHK-PEDES, ICFG-PEDES, and RSTPReid datasets by a clear margin, respectively.

PDF Abstract

Code

Add Remove Mark official

Shuyu-XJTU/APTM official

114

Tasks

Add Remove

Attribute

Image-text matching

Pedestrian Attribute Recognition

Person Retrieval

Representation Learning

Retrieval

Text based Person Retrieval

Text-based Person Retrieval

Text Matching

Datasets

CUHK-PEDES

PA-100K

RSTPReid ICFG-PEDES

Results from the Paper

Edit

Ranked #1 on Text based Person Retrieval on CUHK-PEDES

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text based Person Retrieval	CUHK-PEDES	APTM	R@1	76.53	# 1	Compare
			R@10	94.15	# 2	Compare
			R@5	90.04	# 3	Compare
			mAP	66.91	# 5	Compare
Text based Person Retrieval	ICFG-PEDES	APTM	mAP	41.22	# 2	Compare
Text based Person Retrieval	ICFG-PEDES	APTM	R@1	68.51	# 1	Compare
Text based Person Retrieval	RSTPReid	APTM	R@1	67.50	# 1	Compare
			R@5	85.70	# 3	Compare
			R@10	91.45	# 1	Compare

Methods

Add Remove

Diffusion

Edit Social Preview

Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove