TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Metric Learning	CARS196	EfficientDML-VPTSP-G/512	R@1	91.2	# 6
Metric Learning	CUB-200-2011	EfficientDML-VPTSP-G/512	R@1	88.5	# 2
Image Retrieval	iNaturalist	EfficientDML-VPTSP-G/512	R@1	84.5	# 2
Metric Learning	In-Shop	EfficientDML-VPTSP-G/512	R@1	92.1	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-semantic-proxies-from-visual-prompts/metric-learning-on-cub-200-2011)](https://paperswithcode.com/sota/metric-learning-on-cub-200-2011?p=learning-semantic-proxies-from-visual-prompts)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-semantic-proxies-from-visual-prompts/image-retrieval-on-inaturalist)](https://paperswithcode.com/sota/image-retrieval-on-inaturalist?p=learning-semantic-proxies-from-visual-prompts)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-semantic-proxies-from-visual-prompts/metric-learning-on-cars196)](https://paperswithcode.com/sota/metric-learning-on-cars196?p=learning-semantic-proxies-from-visual-prompts)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-semantic-proxies-from-visual-prompts/metric-learning-on-in-shop-1)](https://paperswithcode.com/sota/metric-learning-on-in-shop-1?p=learning-semantic-proxies-from-visual-prompts)`

Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric Learning

4 Feb 2024 · Li Ren, Chen Chen, Liqiang Wang, Kien Hua ·

Deep Metric Learning (DML) has long attracted the attention of the machine learning community as a key objective. Existing solutions concentrate on fine-tuning the pre-trained models on conventional image datasets. As a result of the success of recent pre-trained models trained from larger-scale datasets, it is challenging to adapt the model to the DML tasks in the local data domain while retaining the previously gained knowledge. In this paper, we investigate parameter-efficient methods for fine-tuning the pre-trained model for DML tasks. In particular, we propose a novel and effective framework based on learning Visual Prompts (VPT) in the pre-trained Vision Transformers (ViT). Based on the conventional proxy-based DML paradigm, we augment the proxy by incorporating the semantic information from the input image and the ViT, in which we optimize the visual prompts for each class. We demonstrate that our new approximations with semantic information are superior to representative capabilities, thereby improving metric learning performance. We conduct extensive experiments to demonstrate that our proposed framework is effective and efficient by evaluating popular DML benchmarks. In particular, we demonstrate that our fine-tuning method achieves comparable or even better performance than recent state-of-the-art full fine-tuning works of DML while tuning only a small percentage of total parameters.

PDF Abstract

Code

Add Remove Mark official

noahsark/parameterefficient-dml official

Tasks

Add Remove

Image Retrieval

Metric Learning

Datasets

ImageNet

CUB-200-2011

iNaturalist

Stanford Online Products

In-Shop CARS196

Results from the Paper

Add Remove

Ranked #2 on Image Retrieval on iNaturalist

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Metric Learning	CARS196	EfficientDML-VPTSP-G/512	R@1	91.2	# 6	Compare
Metric Learning	CUB-200-2011	EfficientDML-VPTSP-G/512	R@1	88.5	# 2	Compare
Image Retrieval	iNaturalist	EfficientDML-VPTSP-G/512	R@1	84.5	# 2	Compare
Metric Learning	In-Shop	EfficientDML-VPTSP-G/512	R@1	92.1	# 9	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove