4.6 Article

Depression recognition base on acoustic speech model of Multi-task emotional stimulus

期刊

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.bspc.2023.104970

关键词

Depression recognition; Feature selection; Emotional stimulus; Bagging

向作者/读者索取更多资源

Depression imposes a significant burden on families and society due to its high prevalence, recurrence, and disability mortality. Researchers are increasingly focusing on using efficient and objective methods to recognize depression. Subtle changes in the speaker's physical and mental state are subconsciously reflected in their vocal apparatus. Speech signals are easily influenced by emotional stimuli, making them a substantial factor in depression recognition.
Depression places great burden on families and society owning to its high prevalence recurrence and disability mortality. Using efficient and objective methods to recognized depression has attracted more and more attention from researchers. Subtle changes in the speaker's physical and mental state will be subconsciously reflected in vocal apparatus. Individuals have different responses to different emotional stimuli. Speech signals are easily affected by emotional stimuli, and thus will have a great impact on depression recognition. This study has two aims, first was to collect speech data in different emotional stimulus (positive, neutral and negative), and explore effective feature set with strong interpretability. The second aim was to design efficient multi-task recognition model. A depression recognition method based on max-relevance and min-redundancy (mRMR) with multi-class labels (MCL-mRMR) and multi-task stimulus weighted Bagging (MTSW-Bagging) classifier was proposed. Firstly, MCL-mRMR selected features which had high correlation with emotional valence and depression, meanwhile features' dimensions decreased. Next, MTSW-Bagging classifier was designed to recognize depression, whose base classifier was composed of weighted multi-task emotional stimulus classifiers. Experimental results showed that the features selected by MCL-mRMR had higher performance with the accuracy and F1 score were increased by 5.59% and 4.2% respectively compared with the original full features. Meanwhile, our proposed method was superior to baseline method with an improvement of 13.2% and 12.8% on accuracy and F1 score respectively. Compared with state-of-the-art related methods, our method also had its superiority of strong interpretability of features and being independent of training data scale.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据