DivideMix: Learning with Noisy Labels as Semi-supervised Learning

ICLR 2020  ·  Junnan Li, Richard Socher, Steven C. H. Hoi ·

Deep neural networks are known to be annotation-hungry. Numerous efforts have been devoted to reducing the annotation cost when learning with deep networks. Two prominent directions include learning with noisy labels and semi-supervised learning by exploiting unlabeled data. In this work, we propose DivideMix, a novel framework for learning with noisy labels by leveraging semi-supervised learning techniques. In particular, DivideMix models the per-sample loss distribution with a mixture model to dynamically divide the training data into a labeled set with clean samples and an unlabeled set with noisy samples, and trains the model on both the labeled and unlabeled data in a semi-supervised manner. To avoid confirmation bias, we simultaneously train two diverged networks where each network uses the dataset division from the other network. During the semi-supervised training phase, we improve the MixMatch strategy by performing label co-refinement and label co-guessing on labeled and unlabeled samples, respectively. Experiments on multiple benchmark datasets demonstrate substantial improvements over state-of-the-art methods. Code is available at https://github.com/LiJunnan1992/DivideMix .

PDF Abstract ICLR 2020 PDF ICLR 2020 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Learning with noisy labels CIFAR-100N Divide-Mix Accuracy (mean) 71.13 # 3
Learning with noisy labels CIFAR-10N-Aggregate Divide-Mix Accuracy (mean) 95.01 # 6
Learning with noisy labels CIFAR-10N-Random1 Divide-Mix Accuracy (mean) 90.18 # 13
Learning with noisy labels CIFAR-10N-Random2 Divide-Mix Accuracy (mean) 90.90 # 7
Learning with noisy labels CIFAR-10N-Random3 Divide-Mix Accuracy (mean) 89.97 # 11
Learning with noisy labels CIFAR-10N-Worst Divide-Mix Accuracy (mean) 92.56 # 5
Image Classification Clothing1M DivideMix Accuracy 74.76% # 14
Image Classification mini WebVision 1.0 DivideMix (Inception-ResNet-v2) Top-1 Accuracy 77.32 # 29
Top-5 Accuracy 91.64 # 21
ImageNet Top-1 Accuracy 75.20 # 18
ImageNet Top-5 Accuracy 91.64 # 19
Image Classification mini WebVision 1.0 DivideMix (ResNet-50) Top-1 Accuracy 76.32 ±0.36 # 31
Top-5 Accuracy 90.65 ±0.16 # 24
ImageNet Top-1 Accuracy 74.42 ±0.29 # 22
ImageNet Top-5 Accuracy 91.21 ±0.12 # 22
Image Classification mini WebVision 1.0 DivideMix (ResNet-18) Top-1 Accuracy 76.08 # 32

Methods


No methods listed for this paper. Add relevant methods here