TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Domain Adaptation	ImageCLEF-DA	CMKD	Accuracy	94.3	# 1
Domain Adaptation	Office-31	CMKD	Average Accuracy	94.4	# 2
Domain Adaptation	Office-Home	CMKD	Accuracy	89.0	# 2
Domain Adaptation	VisDA2017	CMKD	Accuracy	91.8	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unsupervised-domain-adaption-harnessing/domain-adaptation-on-imageclef-da)](https://paperswithcode.com/sota/domain-adaptation-on-imageclef-da?p=unsupervised-domain-adaption-harnessing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unsupervised-domain-adaption-harnessing/domain-adaptation-on-office-31)](https://paperswithcode.com/sota/domain-adaptation-on-office-31?p=unsupervised-domain-adaption-harnessing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unsupervised-domain-adaption-harnessing/domain-adaptation-on-office-home)](https://paperswithcode.com/sota/domain-adaptation-on-office-home?p=unsupervised-domain-adaption-harnessing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unsupervised-domain-adaption-harnessing/domain-adaptation-on-visda2017)](https://paperswithcode.com/sota/domain-adaptation-on-visda2017?p=unsupervised-domain-adaption-harnessing)`

Unsupervised Domain Adaption Harnessing Vision-Language Pre-training

journal 2024 · Wenlve Zhou and Zhiheng Zhou ·

This paper addresses two vital challenges in Unsupervised Domain Adaptation (UDA) with a focus on harnessing the power of Vision-Language Pre-training (VLP) models. Firstly, UDA has primarily relied on ImageNet pre-trained models. However, the potential of VLP models in UDA remains largely unexplored. The rich representation of VLP models holds significant promise for enhancing UDA tasks. To address this, we propose a novel method called Cross-Modal Knowledge Distillation (CMKD), leveraging VLP models as teacher models to guide the learning process in the target domain, resulting in state-of-the-art performance. Secondly, current UDA paradigms involve training separate models for each task, leading to significant storage overhead and impractical model deployment as the number of transfer tasks grows. To overcome this challenge, we introduce Residual Sparse Training (RST) exploiting the benefits conferred by VLP's extensive pre-training, a technique that requires minimal adjustment (approximately 0.1%~0.5%) of VLP model parameters to achieve performance comparable to fine-tuning. Combining CMKD and RST, we present a comprehensive solution that effectively leverages VLP models for UDA tasks while reducing storage overhead for model deployment. Furthermore, CMKD can serve as a baseline in conjunction with other methods like FixMatch, enhancing the performance of UDA. Our proposed method outperforms existing techniques on standard benchmarks. Our code will be available at: https://github.com/Wenlve-Zhou/VLP-UDA.

PDF

Code

Add Remove Mark official

Wenlve-Zhou/VLP-UDA official

Tasks

Add Remove

Domain Adaptation

Unsupervised Domain Adaptation

Datasets

Office-Home

Office-31

VisDA-2017

ImageCLEF-DA

Results from the Paper

Add Remove

Ranked #1 on Domain Adaptation on ImageCLEF-DA

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Domain Adaptation	ImageCLEF-DA	CMKD	Accuracy	94.3	# 1	Compare
Domain Adaptation	Office-31	CMKD	Average Accuracy	94.4	# 2	Compare
Domain Adaptation	Office-Home	CMKD	Accuracy	89.0	# 2	Compare
Domain Adaptation	VisDA2017	CMKD	Accuracy	91.8	# 2	Compare

Methods

Add Remove

FixMatch • Focus • Knowledge Distillation

Edit Social Preview

Unsupervised Domain Adaption Harnessing Vision-Language Pre-training

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove