4.7 Article

Joint Deep Learning of Facial Expression Synthesis and Recognition

Journal

IEEE TRANSACTIONS ON MULTIMEDIA
Volume 22, Issue 11, Pages 2792-2807

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2019.2962317

Keywords

Gallium nitride; Face recognition; Databases; Generative adversarial networks; Deep learning; Training data; Generators; Facial expression recognition; facial expression synthesis; convolutional neural networks (CNNs); generative adversarial net (GAN)

Funding

  1. National Key R&D Program of China [2017YFB1302400]
  2. National Natural Science Foundation of China [61571379, U1605252, 61872307]
  3. Natural Science Foundation of Fujian Province of China [2017J01127, 2018J01576]

Ask authors/readers for more resources

Recently, deep learning based facial expression recognition (FER) methods have attracted considerable attention and they usually require large-scale labelled training data. Nonetheless, the publicly available facial expression databases typically contain a small amount of labelled data. In this paper, to overcome the above issue, we propose a novel joint deep learning of facial expression synthesis and recognition method for effective FER. More specifically, the proposed method involves a two-stage learning procedure. Firstly, a facial expression synthesis generative adversarial network (FESGAN) is pre-trained to generate facial images with different facial expressions. To increase the diversity of the training images, FESGAN is elaborately designed to generate images with new identities from a prior distribution. Secondly, an expression recognition network is jointly learned with the pre-trained FESGAN in a unified framework. In particular, the classification loss computed from the recognition network is used to simultaneously optimize the performance of both the recognition network and the generator of FESGAN. Moreover, in order to alleviate the problem of data bias between the real images and the synthetic images, we propose an intra-class loss with a novel real data-guided back-propagation (RDBP) algorithm to reduce the intra-class variations of images from the same class, which can significantly improve the final performance. Extensive experimental results on public facial expression databases demonstrate the superiority of the proposed method compared with several state-of-the-art FER methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available