Clustering-friendly Representation Learning via Instance Discrimination and Feature Decorrelation

ICLR 2021  ·  Yaling Tao, Kentaro Takagi, Kouta Nakata ·

Clustering is one of the most fundamental tasks in machine learning. Recently, deep clustering has become a major trend in clustering techniques. Representation learning often plays an important role in the effectiveness of deep clustering, and thus can be a principal cause of performance degradation. In this paper, we propose a clustering-friendly representation learning method using instance discrimination and feature decorrelation. Our deep-learning-based representation learning method is motivated by the properties of classical spectral clustering. Instance discrimination learns similarities among data and feature decorrelation removes redundant correlation among features. We utilize an instance discrimination method in which learning individual instance classes leads to learning similarity among instances. Through detailed experiments and examination, we show that the approach can be adapted to learning a latent space for clustering. We design novel softmax-formulated decorrelation constraints for learning. In evaluations of image clustering using CIFAR-10 and ImageNet-10, our method achieves accuracy of 81.5% and 95.4%, respectively. We also show that the softmax-formulated constraints are compatible with various neural networks.

PDF Abstract ICLR 2021 PDF ICLR 2021 Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Image Clustering CIFAR-10 IDFD Accuracy 0.815 # 17
NMI 0.711 # 15
Train set Train+Test # 1
ARI 0.663 # 16
Backbone ResNet-18 # 1
Image Clustering CIFAR-100 IDFD Accuracy 0.425 # 15
NMI 0.426 # 13
Train Set Train # 1
ARI 0.264 # 14
Image Clustering ImageNet-10 IDFD Accuracy 0.954 # 5
NMI 0.898 # 6
ARI 0.901 # 5
Image Size 96 # 3
Image Clustering Imagenet-dog-15 IDFD Accuracy 0.591 # 8
NMI 0.546 # 8
ARI 0.413 # 8
Image Size 96 # 5
Image Clustering STL-10 IDFD Accuracy 0.756 # 12
NMI 0.643 # 10
Train Split Train+Test # 1
Backbone ResNet-18 # 1

Methods


No methods listed for this paper. Add relevant methods here