TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text-based Image Editing	PIE-Bench	DDIM Inversion+Pix2Pix-Zero	CLIPSIM	22.80	# 14
Text-based Image Editing	PIE-Bench	DDIM Inversion+Pix2Pix-Zero	Structure Distance	61.68	# 13
Text-based Image Editing	PIE-Bench	DDIM Inversion+Pix2Pix-Zero	Background PSNR	20.44	# 13
Text-based Image Editing	PIE-Bench	DDIM Inversion+Pix2Pix-Zero	Background LPIPS	172.22	# 13

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/zero-shot-image-to-image-translation/text-based-image-editing-on-pie-bench)](https://paperswithcode.com/sota/text-based-image-editing-on-pie-bench?p=zero-shot-image-to-image-translation)`

Zero-shot Image-to-Image Translation

6 Feb 2023 · Gaurav Parmar, Krishna Kumar Singh, Richard Zhang, Yijun Li, Jingwan Lu, Jun-Yan Zhu ·

Large-scale text-to-image generative models have shown their remarkable ability to synthesize diverse and high-quality images. However, it is still challenging to directly apply these models for editing real images for two reasons. First, it is hard for users to come up with a perfect text prompt that accurately describes every visual detail in the input image. Second, while existing models can introduce desirable changes in certain regions, they often dramatically alter the input content and introduce unexpected changes in unwanted regions. In this work, we propose pix2pix-zero, an image-to-image translation method that can preserve the content of the original image without manual prompting. We first automatically discover editing directions that reflect desired edits in the text embedding space. To preserve the general content structure after editing, we further propose cross-attention guidance, which aims to retain the cross-attention maps of the input image throughout the diffusion process. In addition, our method does not need additional training for these edits and can directly use the existing pre-trained text-to-image diffusion model. We conduct extensive experiments and show that our method outperforms existing and concurrent works for both real and synthetic image editing.

PDF Abstract

Code

Add Remove Mark official

pix2pixzero/pix2pix-zero official

↳ Quickstart in

Spaces

1,001

hansam95/nmg

Tasks

Add Remove

Image-to-Image Translation

Text-based Image Editing

Translation

Datasets

PIE-Bench

Results from the Paper

Edit

Ranked #13 on Text-based Image Editing on PIE-Bench

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text-based Image Editing	PIE-Bench	DDIM Inversion+Pix2Pix-Zero	CLIPSIM	22.80	# 14	Compare
			Structure Distance	61.68	# 13	Compare
			Background PSNR	20.44	# 13	Compare
			Background LPIPS	172.22	# 13	Compare

Methods

Add Remove

Diffusion

Edit Social Preview

Zero-shot Image-to-Image Translation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove