期刊
PATTERN ANALYSIS AND APPLICATIONS
卷 26, 期 2, 页码 543-553出版社
SPRINGER
DOI: 10.1007/s10044-022-01124-w
关键词
Facial expression recognition; Local relation; Attention mechanism; Deep convolutional network
This article introduces a novel method for facial expression recognition, which focuses on discriminative attention regions and pretrains on ImageNet to alleviate over-fitting. Experimental results demonstrate the superior performance of this method on multiple benchmark datasets.
Both the multiple sources of the available in-the-wild datasets and noisy information of images lead to huge challenges for discriminating subtle distinctions between combinations of regional expressions in facial expression recognition (FER). Although deep learning-based approaches have made substantial progresses in FER in recent years, small-scale datasets result in over-fitting during training. To this end, we propose a novel LSGB method which focuses on discriminative attention regions accurately and pretrain the model on ImageNet with the aim of alleviating the problem of over-fitting. Specifically, a more efficient manner combined with a key map, multiple partial maps and a position map is presented in local relation (LR) module to construct higher-level entities through compositional relationship of local pixel pairs. A compact global weighted representation is aggregated by region features, of which the weight is obtained by putting original and regional images to the sequential layer of self-attention module. Finally, extensive experiments are conducted to verify the effectiveness of our proposal. The experimental results on three popular benchmarks demonstrate the superiority of our network with 88.8% on FERplus, 58.68% on AffectNet and 94.9% on JAFFE.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据