PixelGAN Autoencoders

NeurIPS 2017  ·  Alireza Makhzani, Brendan Frey ·

In this paper, we describe the "PixelGAN autoencoder", a generative autoencoder in which the generative path is a convolutional autoregressive neural network on pixels (PixelCNN) that is conditioned on a latent code, and the recognition path uses a generative adversarial network (GAN) to impose a prior distribution on the latent code. We show that different priors result in different decompositions of information between the latent code and the autoregressive decoder. For example, by imposing a Gaussian distribution as the prior, we can achieve a global vs. local decomposition, or by imposing a categorical distribution as the prior, we can disentangle the style and content information of images in an unsupervised fashion. We further show how the PixelGAN autoencoder with a categorical prior can be directly used in semi-supervised settings and achieve competitive semi-supervised classification results on the MNIST, SVHN and NORB datasets.

PDF Abstract NeurIPS 2017 PDF NeurIPS 2017 Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Unsupervised MNIST MNIST PixelGAN Autoencoders Accuracy 94.73 # 10
Unsupervised Image Classification MNIST PixelGAN Autoencoders Accuracy 94.73 # 9

Methods