4.6 Article

Local and correlation attention learning for subtle facial expression recognition

Journal

NEUROCOMPUTING
Volume 453, Issue -, Pages 742-753

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2020.07.120

Keywords

Facial expression recognition; Feature extraction; Neural network; Attention mechanism

Funding

  1. National Natural Science Found for Distinguished Young Scholars [61925112, 61825603]
  2. National Key R&D Program of China [2017YFB0502900]
  3. National Natural Science Foundation of China [61806193, 61702498, 61772510]
  4. Young TopNotch Talent Program of Chinese Academy of Sciences [QYZDBSSWJSC015]
  5. CAS Light of West China Program [XAB2017B26, XAB2017B15]
  6. State Key Program of National Natural Science of China [61632018]
  7. National Defense Science and Technology Innovation Special Zone Project

Ask authors/readers for more resources

A model based on attention mechanism is proposed in this paper to focus on salient local regions and their correlations for subtle facial expression recognition.
Subtle facial expression recognition (SFER) aims to classify facial expressions with very low intensity into corresponding human emotions. Subtle facial expression can be regarded as a special kind of facial expression, whose facial muscle movements are more difficult to capture. In the last decade, various methods have been developed for common facial expression recognition (FER). However, most of them failed to automatically find the most discriminative parts of facial expression and the correlation of muscle movements when human makes facial expression, which makes them unsuitable for SFER. To better solve SFER problem, an attention mechanism based model focusing on salient local regions and their correlations is proposed in this paper. The proposed method: 1) utilizes multiple attention blocks to attend to distinct discriminative regions and extract corresponding local features automatically, 2) a correlation attention module is integrated in the model to extract global correlation feature over the salient regions, and finally 3) fuses the correlation feature and local features in an efficient way for the final facial expression classification. By this way, the useful but subtle local information can be utilized in more detail, and the correlation of different local regions is also extracted. Extensive experiment on the LSEMSW and CK+ datasets shows that the method proposed in this paper achieves superior results, which demonstrates its effectiveness. (c) 2020 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available