A Light CNN for Deep Face Representation with Noisy Labels

9 Nov 2015  ·  Xiang Wu, Ran He, Zhenan Sun, Tieniu Tan ·

The volume of convolutional neural network (CNN) models proposed for face recognition has been continuously growing larger to better fit large amount of training data. When training data are obtained from internet, the labels are likely to be ambiguous and inaccurate. This paper presents a Light CNN framework to learn a compact embedding on the large-scale face data with massive noisy labels. First, we introduce a variation of maxout activation, called Max-Feature-Map (MFM), into each convolutional layer of CNN. Different from maxout activation that uses many feature maps to linearly approximate an arbitrary convex activation function, MFM does so via a competitive relationship. MFM can not only separate noisy and informative signals but also play the role of feature selection between two feature maps. Second, three networks are carefully designed to obtain better performance meanwhile reducing the number of parameters and computational costs. Lastly, a semantic bootstrapping method is proposed to make the prediction of the networks more consistent with noisy labels. Experimental results show that the proposed framework can utilize large-scale noisy data to learn a Light model that is efficient in computational costs and storage spaces. The learned single network with a 256-D representation achieves state-of-the-art results on various face benchmarks without fine-tuning. The code is released on https://github.com/AlfredXiangWu/LightCNN.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Age-Invariant Face Recognition CACDVS MFM-CNN Accuracy 97.95% # 8
Age-Invariant Face Recognition CAFR Light CNN Accuracy 73.56% # 2
Face Identification MegaFace Light CNN-29 Accuracy 73.749% # 11
Face Verification MegaFace Light CNN-29 Accuracy 85.133% # 12
Face Verification YouTube Faces DB Light CNN-29 Accuracy 95.54% # 7

Methods