GENIE: Higher-Order Denoising Diffusion Solvers

11 Oct 2022  ·  Tim Dockhorn, Arash Vahdat, Karsten Kreis ·

Denoising diffusion models (DDMs) have emerged as a powerful class of generative models. A forward diffusion process slowly perturbs the data, while a deep model learns to gradually denoise. Synthesis amounts to solving a differential equation (DE) defined by the learnt model. Solving the DE requires slow iterative solvers for high-quality generation. In this work, we propose Higher-Order Denoising Diffusion Solvers (GENIE): Based on truncated Taylor methods, we derive a novel higher-order solver that significantly accelerates synthesis. Our solver relies on higher-order gradients of the perturbed data distribution, that is, higher-order score functions. In practice, only Jacobian-vector products (JVPs) are required and we propose to extract them from the first-order score network via automatic differentiation. We then distill the JVPs into a separate neural network that allows us to efficiently compute the necessary higher-order terms for our novel sampler during synthesis. We only need to train a small additional head on top of the first-order score network. We validate GENIE on multiple image generation benchmarks and demonstrate that GENIE outperforms all previous solvers. Unlike recent methods that fundamentally alter the generation process in DDMs, our GENIE solves the true generative DE and still enables applications such as encoding and guided sampling. Project page and code: https://nv-tlabs.github.io/GENIE.

PDF Abstract

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Image Generation AFHQV2 GENIE (NFEs=15) FID 4.83 # 5
Image Generation AFHQV2 GENIE (NFEs=10) FID 4.9 # 6
Image Generation AFHQV2 GENIE (NFEs=5) FID 5.53 # 7
Image Generation CIFAR-10 GENIE (NFEs=10) FID 5.97 # 75
Image Generation CIFAR-10 GENIE (Learned Sampler, NFEs=10) FID 5.28 # 72
Image Generation CIFAR-10 GENIE (NFEs=15) FID 4.49 # 65
Image Generation CIFAR-10 GENIE (Learned Sampler, NFEs=15) FID 4.49 # 65
Image Generation CIFAR-10 GENIE (NFEs=20) FID 3.94 # 58
Image Generation CIFAR-10 GENIE (Learned Sampler, NFEs=20) FID 3.94 # 58
Image Generation CIFAR-10 GENIE (NFEs=25) FID 3.67 # 55
Image Generation CIFAR-10 GENIE (Learned Sampler, NFEs=25) FID 3.64 # 53
Image Generation CIFAR-10 GENIE (NFEs=5) FID 13.9 # 107
Image Generation CIFAR-10 GENIE (Learned Sampler, NFEs=5) FID 11.2 # 94

Methods