Dimensionality-Driven Learning with Noisy Labels

Datasets with significant proportions of noisy (incorrect) class labels present challenges for training accurate Deep Neural Networks (DNNs). We propose a new perspective for understanding DNN generalization for such datasets, by investigating the dimensionality of the deep representation subspace of training samples. We show that from a dimensionality perspective, DNNs exhibit quite distinctive learning styles when trained with clean labels versus when trained with a proportion of noisy labels. Based on this finding, we develop a new dimensionality-driven learning strategy, which monitors the dimensionality of subspaces during training and adapts the loss function accordingly. We empirically demonstrate that our approach is highly tolerant to significant proportions of noisy labels, and can effectively learn low-dimensional local subspaces that capture the data distribution.

PDF Abstract ICML 2018 PDF ICML 2018 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Image Classification Clothing1M D2L Accuracy 69.47% # 49

Results from Other Papers


Task Dataset Model Metric Name Metric Value Rank Source Paper Compare
Image Classification mini WebVision 1.0 D2L (Inception-ResNet-v2) Top-1 Accuracy 62.68 # 39
Top-5 Accuracy 84.00 # 30
ImageNet Top-1 Accuracy 57.80 # 35
ImageNet Top-5 Accuracy 81.36 # 32

Methods


No methods listed for this paper. Add relevant methods here