☆ 4.7 Article

Joint Deep Learning of Facial Expression Synthesis and Recognition

IEEE TRANSACTIONS ON MULTIMEDIA (2020)

Journal

IEEE TRANSACTIONS ON MULTIMEDIA

Volume 22, Issue 11, Pages 2792-2807

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TMM.2019.2962317

Keywords

Gallium nitride; Face recognition; Databases; Generative adversarial networks; Deep learning; Training data; Generators; Facial expression recognition; facial expression synthesis; convolutional neural networks (CNNs); generative adversarial net (GAN)

Funding

National Key R&D Program of China [2017YFB1302400]
National Natural Science Foundation of China [61571379, U1605252, 61872307]
Natural Science Foundation of Fujian Province of China [2017J01127, 2018J01576]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Recently, deep learning based facial expression recognition (FER) methods have attracted considerable attention and they usually require large-scale labelled training data. Nonetheless, the publicly available facial expression databases typically contain a small amount of labelled data. In this paper, to overcome the above issue, we propose a novel joint deep learning of facial expression synthesis and recognition method for effective FER. More specifically, the proposed method involves a two-stage learning procedure. Firstly, a facial expression synthesis generative adversarial network (FESGAN) is pre-trained to generate facial images with different facial expressions. To increase the diversity of the training images, FESGAN is elaborately designed to generate images with new identities from a prior distribution. Secondly, an expression recognition network is jointly learned with the pre-trained FESGAN in a unified framework. In particular, the classification loss computed from the recognition network is used to simultaneously optimize the performance of both the recognition network and the generator of FESGAN. Moreover, in order to alleviate the problem of data bias between the real images and the synthetic images, we propose an intra-class loss with a novel real data-guided back-propagation (RDBP) algorithm to reduce the intra-class variations of images from the same class, which can significantly improve the final performance. Extensive experimental results on public facial expression databases demonstrate the superiority of the proposed method compared with several state-of-the-art FER methods.

Joint Deep Learning of Facial Expression Synthesis and Recognition

Journal

IEEE TRANSACTIONS ON MULTIMEDIA

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Joint Deep Learning of Facial Expression Synthesis and Recognition

Journal

IEEE TRANSACTIONS ON MULTIMEDIA

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper