Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

19 Nov 2015  ·  Alec Radford, Luke Metz, Soumith Chintala ·

In recent years, supervised learning with convolutional networks (CNNs) has seen huge adoption in computer vision applications. Comparatively, unsupervised learning with CNNs has received less attention. In this work we hope to help bridge the gap between the success of CNNs for supervised learning and unsupervised learning. We introduce a class of CNNs called deep convolutional generative adversarial networks (DCGANs), that have certain architectural constraints, and demonstrate that they are a strong candidate for unsupervised learning. Training on various image datasets, we show convincing evidence that our deep convolutional adversarial pair learns a hierarchy of representations from object parts to scenes in both the generator and discriminator. Additionally, we use the learned features for novel tasks - demonstrating their applicability as general image representations.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Image Classification CIFAR-10 DCGAN Percentage correct 82.8 # 210
Conditional Image Generation CIFAR-10 DCGAN Inception score 6.58 # 18
Image Clustering CIFAR-10 GAN Accuracy 0.315 # 26
NMI 0.265 # 22
Train set Train+Test # 1
ARI 0.176 # 22
Backbone GAN # 1
Image Classification CIFAR-10 1 Layer K-means Percentage correct 80.6 # 215
Image Classification SVHN TSVM Percentage error 66.55 # 57
Image Classification SVHN DCGAN Percentage error 22.48 # 50
Image Classification SVHN KNN Percentage error 77.93 # 58
Image Classification SVHN Supervised CNN Percentage error 28.87 # 52

Results from Other Papers


Task Dataset Model Metric Name Metric Value Rank Uses Extra
Training Data
Source Paper Compare
Image Clustering CIFAR-100 GAN Accuracy 0.151 # 22
NMI 0.120 # 18
Train Set Train+Test # 1
Image Clustering ImageNet-10 GAN Accuracy 0.346 # 14
NMI 0.225 # 14
Image Clustering Imagenet-dog-15 GAN Accuracy 0.174 # 17
NMI 0.121 # 16
Image Clustering STL-10 GAN Accuracy 0.298 # 22
NMI 0.210 # 19
Train Split Train+Test # 1
Image Clustering Tiny-ImageNet GAN Accuracy 0.041 # 9
NMI 0.135 # 9

Methods