4.7 Article

A Convolution Bidirectional Long Short-Term Memory Neural Network for Driver Emotion Recognition

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TITS.2020.3007357

关键词

Emotion recognition; facial skin; heart rate; bidirectional long short-term memory (Bi-LSTM); CBLNN

资金

  1. National Key Research and Development Program of China [2018YFB1700500]
  2. National Natural Science Foundation of China [61973126, 61902147]
  3. Guangdong Natural Science Funds for Distinguished Young Scholar [2017A030306015]
  4. Pearl River S&T Nova Program of Guangzhou [201710010059]
  5. Guangdong Special Projects [2016TQ03X824]
  6. Fundamental Research Funds for the Central Universities [2019ZD27, 21619312]
  7. Innovation Team of the Modern Agriculture Industry Technology System in Guangdong Province [2019KJ139]
  8. Key-Area Research and Development Program of Guangdong Province [2020B010166006]
  9. Science and Technology Planning Project of Guangdong Province [2017B090914002]
  10. Scientific research project of Southwest Minzu University [2015NYB11]

向作者/读者索取更多资源

The paper proposes a new deep learning framework CBLNN for real-time recognition of driver emotions by extracting facial information and heart rate data. Tested and proven to be able to quickly and steadily identify happiness, anger, sadness, fear, and neutrality in real time.
Real-time recognition of driver emotions can greatly improve traffic safety. With the rapid development of communication technology, it becomes possible to process large amounts of video data and identify the driver's emotions in real time. To effectively recognize driver's emotions, this paper proposes a new deep learning framework called Convolution Bidirectional Long Short-term Memory Neural Network (CBLNN). This method predicts the driver's emotion based on the geometric features extracted from facial skin information and the heart rate extracted from changes in RGB components. The facial geometry features obtained by using Convolutional Neural Network (CNN) are intermediate variables for the heart rate analysis of Bidirectional Long Short Term Memory (Bi-LSTM). Subsequently, the output of Bi-LSTM is used as input to the CNN module to extract the hear rate features. CBLNN uses Multi-modal factorized bilinear pooling (MFB) to fuse the extracted information and classifies it into five common emotions: happiness, anger, sadness, fear and neutrality. Our emotion recognition method was tested, proving that it can be used to quickly and steadily recognize emotions in real time.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据