4.6 Article

A novel enhanced convolution neural network with extreme learning machine: facial emotional recognition in psychology practices

期刊

MULTIMEDIA TOOLS AND APPLICATIONS
卷 82, 期 5, 页码 6479-6503

出版社

SPRINGER
DOI: 10.1007/s11042-022-13567-8

关键词

Convolution neural network; Stochastic gradient descent; Log-likelihood estimator; Optical flow estimation; Extreme learning machine; Cross-entropy loss

向作者/读者索取更多资源

This research aims to improve facial emotion recognition accuracy and reduce processing time using a modified Convolution Neural Network Enhanced with Extreme Learning Machine (CNNEELM). By using optical flow estimation technique for motion detection and image preprocessing, the accuracy in facial emotion recognition is improved, while the training with specific models and datasets speeds up the image processing.
Facial emotional recognition is one of the essential tools used by recognition psychology to diagnose patients. Face and facial emotional recognition are areas where machine learning is excelling. Facial Emotion Recognition in an unconstrained environment is an open challenge for digital image processing due to different environments, such as lighting conditions, pose variation, yaw motion, and occlusions. Deep learning approaches have shown significant improvements in image recognition. However, accuracy and time still need improvements. This research aims to improve facial emotion recognition accuracy during the training session and reduce processing time using a modified Convolution Neural Network Enhanced with Extreme Learning Machine (CNNEELM). The proposed system consists of an optical flow estimation technique that detects the motion of change in facial expression and extracts peak images from video frames for image pre-processing. The system entails (CNNEELM) improving the accuracy in image registration during the training session. Furthermore, the system recognizes six facial emotions - happy, sad, disgust, fear, surprise, and neutral with the proposed CNNEELM model. The study shows that the overall facial emotion recognition accuracy is improved by 2% than the state of art solutions with a modified Stochastic Gradient Descent (SGD) technique. With the Extreme Learning Machine (ELM) classifier, the processing time is brought down to 65 ms from 113 ms, which can smoothly classify each frame from a video clip at 20fps. With the pre-trained InceptionV3 model, the proposed CNNEELM model is trained with JAFFE, CK+, and FER2013 expression datasets. The simulation results show significant improvements in accuracy and processing time, making the model suitable for the video analysis process. Besides, the study solves the issue of the large processing time required to process the facial images.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据