TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Low-Light Image Enhancement	LOL	DA-CLIP	Average PSNR	23.77	# 15
Low-Light Image Enhancement	LOL	DA-CLIP	SSIM	0.830	# 19
Low-Light Image Enhancement	LOL	DA-CLIP	LPIPS	0.083	# 3
Single Image Deraining	Rain100H	DA-CLIP	PSNR	33.91	# 1
Single Image Deraining	Rain100H	DA-CLIP	SSIM	0.926	# 1
Image Dehazing	RESIDE-6K	DA-CLIP	PSNR	30.16	# 3
Image Dehazing	RESIDE-6K	DA-CLIP	SSIM	0.936	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/controlling-vision-language-models-for/single-image-deraining-on-rain100h)](https://paperswithcode.com/sota/single-image-deraining-on-rain100h?p=controlling-vision-language-models-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/controlling-vision-language-models-for/image-dehazing-on-reside-6k)](https://paperswithcode.com/sota/image-dehazing-on-reside-6k?p=controlling-vision-language-models-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/controlling-vision-language-models-for/low-light-image-enhancement-on-lol)](https://paperswithcode.com/sota/low-light-image-enhancement-on-lol?p=controlling-vision-language-models-for)`

Controlling Vision-Language Models for Multi-Task Image Restoration

2 Oct 2023 · Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön ·

Vision-language models such as CLIP have shown great impact on diverse downstream tasks for zero-shot or label-free predictions. However, when it comes to low-level vision such as image restoration their performance deteriorates dramatically due to corrupted inputs. In this paper, we present a degradation-aware vision-language model (DA-CLIP) to better transfer pretrained vision-language models to low-level vision tasks as a multi-task framework for image restoration. More specifically, DA-CLIP trains an additional controller that adapts the fixed CLIP image encoder to predict high-quality feature embeddings. By integrating the embedding into an image restoration network via cross-attention, we are able to pilot the model to learn a high-fidelity image reconstruction. The controller itself will also output a degradation feature that matches the real corruptions of the input, yielding a natural classifier for different degradation types. In addition, we construct a mixed degradation dataset with synthetic captions for DA-CLIP training. Our approach advances state-of-the-art performance on both \emph{degradation-specific} and \emph{unified} image restoration tasks, showing a promising direction of prompting image restoration with large-scale pretrained vision-language models. Our code is available at https://github.com/Algolzw/daclip-uir.

PDF Abstract

Code

Add Remove Mark official

algolzw/daclip-uir official

↳ Quickstart in

Colab

Spaces

Replicate

532

Tasks

Add Remove

Image Dehazing

Image Denoising

Image Inpainting

Image Reconstruction

Image Restoration

JPEG Artifact Removal

Language Modelling

Low-Light Image Enhancement

Rain Removal

Shadow Removal

Single Image Deraining

Unified Image Restoration

Datasets

GoPro

LOL

Raindrop

Synthetic Rain Datasets

Results from the Paper

Edit

Ranked #1 on Single Image Deraining on Rain100H

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Low-Light Image Enhancement	LOL	DA-CLIP	Average PSNR	23.77	# 15	Compare
			SSIM	0.830	# 19	Compare
			LPIPS	0.083	# 3	Compare
Single Image Deraining	Rain100H	DA-CLIP	PSNR	33.91	# 1	Compare
Single Image Deraining	Rain100H	DA-CLIP	SSIM	0.926	# 1	Compare
Image Dehazing	RESIDE-6K	DA-CLIP	PSNR	30.16	# 3	Compare
Image Dehazing	RESIDE-6K	DA-CLIP	SSIM	0.936	# 4	Compare

Methods

Add Remove

CLIP • Diffusion

Edit Social Preview

Controlling Vision-Language Models for Multi-Task Image Restoration

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove