TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Generation	CelebA 64x64	LadaGAN	FID	1.81	# 6
Image Generation	FFHQ 128 x 128	LadaGAN	FID	4.48	# 3
Image Generation	LSUN Bedroom 128 x 128	LadaGAN	FID	4.90	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-generative-adversarial-networks/image-generation-on-lsun-bedroom-128-x-128)](https://paperswithcode.com/sota/image-generation-on-lsun-bedroom-128-x-128?p=efficient-generative-adversarial-networks)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-generative-adversarial-networks/image-generation-on-ffhq-128-x-128)](https://paperswithcode.com/sota/image-generation-on-ffhq-128-x-128?p=efficient-generative-adversarial-networks)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-generative-adversarial-networks/image-generation-on-celeba-64x64)](https://paperswithcode.com/sota/image-generation-on-celeba-64x64?p=efficient-generative-adversarial-networks)`

Efficient generative adversarial networks using linear additive-attention Transformers

17 Jan 2024 · Emilio Morales-Juarez, Gibran Fuentes-Pineda ·

Although the capacity of deep generative models for image generation, such as Diffusion Models (DMs) and Generative Adversarial Networks (GANs), has dramatically improved in recent years, much of their success can be attributed to computationally expensive architectures. This has limited their adoption and use to research laboratories and companies with large resources, while significantly raising the carbon footprint for training, fine-tuning, and inference. In this work, we present LadaGAN, an efficient generative adversarial network that is built upon a novel Transformer block named Ladaformer. The main component of this block is a linear additive-attention mechanism that computes a single attention vector per head instead of the quadratic dot-product attention. We employ Ladaformer in both the generator and discriminator, which reduces the computational complexity and overcomes the training instabilities often associated with Transformer GANs. LadaGAN consistently outperforms existing convolutional and Transformer GANs on benchmark datasets at different resolutions while being significantly more efficient. Moreover, LadaGAN shows competitive performance compared to state-of-the-art multi-step generative models (e.g. DMs) using orders of magnitude less computational resources.

PDF Abstract

Code

Add Remove Mark official

milmor/ladagan official

↳ Quickstart in

Colab

Tasks

Add Remove

Generative Adversarial Network

Image Generation

Datasets

CIFAR-10

CelebA

FFHQ

LSUN

Results from the Paper

Add Remove

Ranked #1 on Image Generation on LSUN Bedroom 128 x 128

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Generation	CelebA 64x64	LadaGAN	FID	1.81	# 6	Compare
Image Generation	FFHQ 128 x 128	LadaGAN	FID	4.48	# 3	Compare
Image Generation	LSUN Bedroom 128 x 128	LadaGAN	FID	4.90	# 1	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Diffusion • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Efficient generative adversarial networks using linear additive-attention Transformers

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove