4.7 Article

Joint Deep Learning of Facial Expression Synthesis and Recognition

期刊

IEEE TRANSACTIONS ON MULTIMEDIA
卷 22, 期 11, 页码 2792-2807

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2019.2962317

关键词

Gallium nitride; Face recognition; Databases; Generative adversarial networks; Deep learning; Training data; Generators; Facial expression recognition; facial expression synthesis; convolutional neural networks (CNNs); generative adversarial net (GAN)

资金

  1. National Key R&D Program of China [2017YFB1302400]
  2. National Natural Science Foundation of China [61571379, U1605252, 61872307]
  3. Natural Science Foundation of Fujian Province of China [2017J01127, 2018J01576]

向作者/读者索取更多资源

Recently, deep learning based facial expression recognition (FER) methods have attracted considerable attention and they usually require large-scale labelled training data. Nonetheless, the publicly available facial expression databases typically contain a small amount of labelled data. In this paper, to overcome the above issue, we propose a novel joint deep learning of facial expression synthesis and recognition method for effective FER. More specifically, the proposed method involves a two-stage learning procedure. Firstly, a facial expression synthesis generative adversarial network (FESGAN) is pre-trained to generate facial images with different facial expressions. To increase the diversity of the training images, FESGAN is elaborately designed to generate images with new identities from a prior distribution. Secondly, an expression recognition network is jointly learned with the pre-trained FESGAN in a unified framework. In particular, the classification loss computed from the recognition network is used to simultaneously optimize the performance of both the recognition network and the generator of FESGAN. Moreover, in order to alleviate the problem of data bias between the real images and the synthetic images, we propose an intra-class loss with a novel real data-guided back-propagation (RDBP) algorithm to reduce the intra-class variations of images from the same class, which can significantly improve the final performance. Extensive experimental results on public facial expression databases demonstrate the superiority of the proposed method compared with several state-of-the-art FER methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据