4.4 Article

Fusing HOG and convolutional neural network spatial-temporal features for video-based facial expression recognition

Journal

IET IMAGE PROCESSING
Volume 14, Issue 1, Pages 176-182

Publisher

WILEY
DOI: 10.1049/iet-ipr.2019.0293

Keywords

computer vision; face recognition; feature extraction; support vector machines; emotion recognition; convolutional neural nets; video signal processing; convolutional neural network spatial-temporal features; video-based facial expression recognition; VFER; fundamental feature; visual features; comprehensive feature; video frame; HOG features; facial expressions; CNN shallow features

Funding

  1. Zhejiang Provincial Public Welfare Project of China [LGF19F020009]

Ask authors/readers for more resources

Video-based facial expression recognition (VFER) is the fundamental feature of various computer vision applications. Visual features are the key factors for facial expression recognition. However, the gap between the visual features and the emotions is large. In order to bridge the gap, the proposed method utilises convolutional neural networks (CNNs) and histogram of oriented gradient (HOG) to obtain the more comprehensive feature for VFER. Firstly, it extracts shallow features from the video frame through a number of convolutional kernels in CNNs, which has the characteristics of displacement, scale and deformation invariance. Then, the HOG is employed to extract HOG features from CNN's shallow features, which are strongly correlated with facial expressions. Finally, the support vector machine (SVM) is employed to conduct the task of facial expression recognition. The extensive experiments on RML, CK+ and AFEW5.0 database show that this framework takes on the promising performance and outperforming the state of the arts.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available