4.7 Article

Facial Expression Recognition in-the-Wild Using Blended Feature Attention Network

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIM.2023.3314815

关键词

Face recognition; Iron; Feature extraction; Lighting; Training data; Task analysis; Data mining; Attention mechanism; facial expression recognition (FER); fuzzy integral; illumination; intensity variations; occlusion and pose robust; statistical significance

向作者/读者索取更多资源

Facial expression analysis is important in affective computing, marketing, and clinical evaluation. Existing research on facial expression recognition (FER) faces challenges such as occlusion, pose changes, illumination variations, and limited training data. This study proposes a novel method to address these challenges by using modified homomorphic filtering for illumination normalization and dividing the face image into five local regions to highlight expression-specific characteristics. A unique blended feature attention network is then designed to extract relevant and discriminative features and compute probability scores for FER.
Facial expression (FE) analysis plays a crucial role in various fields, such as affective computing, marketing, and clinical evaluation. Despite numerous advances, research on FE recognition (FER) has recently been proceeding from confined laboratory circumstances to in-the-wild environments. FER is still an arduous and demanding problem due to occlusion and pose changes, intraclass and intensity variations caused by illumination, and insufficient training data. Most state-of-the-art (SOTA) approaches use the entire face for FER. However, past studies on psychology and physiology reveal that the mouth and eyes reflect the variations of various FEs, which are closely related to the manifestation of emotion. A novel method is proposed in this study to address some of the issues mentioned above. First, modified homomorphic filtering (MHF) is employed to normalize the illumination, then the normalized face image is cropped into five local regions to emphasize expression-specific characteristics. Finally, a unique blended feature attention network (BFAN) is designed for FER. BFAN consists of both residual dilated multiscale (RDMS) feature extraction modules and spatial and channel-wise attention (CWA) modules. These modules help to extract the most relevant and discriminative features from the high-level (HL) and low-level (LL) features. Then, both feature maps are integrated and passed on to the dense layers followed by a softmax layer to compute probability scores. Finally, the Choquet fuzzy integral is applied to the computed probability scores to get the final outcome. The superiority of the proposed method is exemplified by comparing it with 18 existing approaches on seven benchmark datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据