TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text-based Image Editing	PIE-Bench	Virtual Inversion+Unified Attention Control+LCM	CLIPSIM	25.03	# 3
Text-based Image Editing	PIE-Bench	Virtual Inversion+Unified Attention Control+LCM	Structure Distance	13.78	# 4
Text-based Image Editing	PIE-Bench	Virtual Inversion+Unified Attention Control+LCM	Background PSNR	28.51	# 1
Text-based Image Editing	PIE-Bench	Virtual Inversion+Unified Attention Control+LCM	Background LPIPS	47.58	# 1
Text-based Image Editing	PIE-Bench	Virtual Inversion+Prompt-to-Prompt	CLIPSIM	24.89	# 6
Text-based Image Editing	PIE-Bench	Virtual Inversion+Prompt-to-Prompt	Structure Distance	14.22	# 5
Text-based Image Editing	PIE-Bench	Virtual Inversion+Prompt-to-Prompt	Background PSNR	27.52	# 2
Text-based Image Editing	PIE-Bench	Virtual Inversion+Prompt-to-Prompt	Background LPIPS	47.98	# 2
Text-based Image Editing	PIE-Bench	Virtual Inversion+Prompt-to-Prompt+LCM	CLIPSIM	24.57	# 10
Text-based Image Editing	PIE-Bench	Virtual Inversion+Prompt-to-Prompt+LCM	Structure Distance	15.61	# 6
Text-based Image Editing	PIE-Bench	Virtual Inversion+Prompt-to-Prompt+LCM	Background PSNR	26.64	# 5
Text-based Image Editing	PIE-Bench	Virtual Inversion+Prompt-to-Prompt+LCM	Background LPIPS	55.85	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/inversion-free-image-editing-with-natural/text-based-image-editing-on-pie-bench)](https://paperswithcode.com/sota/text-based-image-editing-on-pie-bench?p=inversion-free-image-editing-with-natural)`

Inversion-Free Image Editing with Natural Language

7 Dec 2023 · Sihan Xu, Yidong Huang, Jiayi Pan, Ziqiao Ma, Joyce Chai ·

Despite recent advances in inversion-based editing, text-guided image manipulation remains challenging for diffusion models. The primary bottlenecks include 1) the time-consuming nature of the inversion process; 2) the struggle to balance consistency with accuracy; 3) the lack of compatibility with efficient consistency sampling methods used in consistency models. To address the above issues, we start by asking ourselves if the inversion process can be eliminated for editing. We show that when the initial sample is known, a special variance schedule reduces the denoising step to the same form as the multi-step consistency sampling. We name this Denoising Diffusion Consistent Model (DDCM), and note that it implies a virtual inversion strategy without explicit inversion in sampling. We further unify the attention control mechanisms in a tuning-free framework for text-guided editing. Combining them, we present inversion-free editing (InfEdit), which allows for consistent and faithful editing for both rigid and non-rigid semantic changes, catering to intricate modifications without compromising on the image's integrity and explicit inversion. Through extensive experiments, InfEdit shows strong performance in various editing tasks and also maintains a seamless workflow (less than 3 seconds on one single A40), demonstrating the potential for real-time applications. Project Page: https://sled-group.github.io/InfEdit/

PDF Abstract

Code

Add Remove Mark official

sled-group/InfEdit official

↳ Quickstart in

Spaces

202

Tasks

Add Remove

Image Manipulation

Text-based Image Editing

Datasets

PIE-Bench

Results from the Paper

Edit

Ranked #1 on Text-based Image Editing on PIE-Bench

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text-based Image Editing	PIE-Bench	Virtual Inversion+Unified Attention Control+LCM	CLIPSIM	25.03	# 3	Compare
			Structure Distance	13.78	# 4	Compare
			Background PSNR	28.51	# 1	Compare
			Background LPIPS	47.58	# 1	Compare
Text-based Image Editing	PIE-Bench	Virtual Inversion+Prompt-to-Prompt	CLIPSIM	24.89	# 6	Compare
			Structure Distance	14.22	# 5	Compare
			Background PSNR	27.52	# 2	Compare
			Background LPIPS	47.98	# 2	Compare
Text-based Image Editing	PIE-Bench	Virtual Inversion+Prompt-to-Prompt+LCM	CLIPSIM	24.57	# 10	Compare
			Structure Distance	15.61	# 6	Compare
			Background PSNR	26.64	# 5	Compare
			Background LPIPS	55.85	# 4	Compare

Methods

Add Remove

Diffusion

Edit Social Preview

Inversion-Free Image Editing with Natural Language

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove