Stable Cluster Discrimination for Deep Clustering

ICCV 2023  ยท  Qi Qian ยท

Deep clustering can optimize representations of instances (i.e., representation learning) and explore the inherent data distribution (i.e., clustering) simultaneously, which demonstrates a superior performance over conventional clustering methods with given features. However, the coupled objective implies a trivial solution that all instances collapse to the uniform features. To tackle the challenge, a two-stage training strategy is developed for decoupling, where it introduces an additional pre-training stage for representation learning and then fine-tunes the obtained model for clustering. Meanwhile, one-stage methods are developed mainly for representation learning rather than clustering, where various constraints for cluster assignments are designed to avoid collapsing explicitly. Despite the success of these methods, an appropriate learning objective tailored for deep clustering has not been investigated sufficiently. In this work, we first show that the prevalent discrimination task in supervised learning is unstable for one-stage clustering due to the lack of ground-truth labels and positive instances for certain clusters in each mini-batch. To mitigate the issue, a novel stable cluster discrimination (SeCu) task is proposed and a new hardness-aware clustering criterion can be obtained accordingly. Moreover, a global entropy constraint for cluster assignments is studied with efficient optimization. Extensive experiments are conducted on benchmark data sets and ImageNet. SeCu achieves state-of-the-art performance on all of them, which demonstrates the effectiveness of one-stage deep clustering. Code is available at \url{https://github.com/idstcv/SeCu}.

PDF Abstract ICCV 2023 PDF ICCV 2023 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Unsupervised Image Classification CIFAR-10 SeCu Accuracy 93 # 1
Image Clustering CIFAR-10 SeCu Accuracy 0.93 # 2
NMI 0.861 # 3
Train set Train # 1
ARI 0.857 # 3
Backbone ResNet-18 # 1
Unsupervised Image Classification CIFAR-20 SeCu Accuracy 55.2 # 4
Image Clustering ImageNet CoKe NMI 76.2 # 8
Accuracy 47.6 # 9
ARI 35.6 # 6
Image Clustering ImageNet SeCu NMI 79.4 # 7
Accuracy 53.5 # 7
ARI 41.9 # 5
Image Clustering STL-10 SeCu Accuracy 0.836 # 8
NMI 0.733 # 6
Train Split Train # 1
ARI 0.693 # 5
Backbone ResNet-18 # 1

Methods


No methods listed for this paper. Add relevant methods here