Facial Expression Recognition using Residual Masking Network

Automatic facial expression recognition (FER) has gained much attention due to its applications in human-computer interaction. Among the approaches to improve FER tasks, this paper focuses on deep architecture with the attention mechanism. We propose a novel Masking Idea to boost the performance of CNN in facial expression task. It uses a segmentation network to refine feature maps, enabling the network to focus on relevant information to make correct decisions. In experiments, we combine the ubiquitous Deep Residual Network and Unet-like architecture to produce a Residual Masking Network. The proposed method holds state-of-the-art (SOTA) accuracy on the well-known FER2013 and private VEMO datasets.

PDF

Datasets


Results from the Paper


 Ranked #1 on Facial Expression Recognition (FER) on FER2013 (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Benchmark
Facial Expression Recognition (FER) FER2013 Ensemble ResMaskingNet with 6 other CNNs Accuracy 76.82 # 1
Facial Expression Recognition (FER) FER2013 Residual Masking Network Accuracy 74.14 # 6

Methods