4.6 Article

Depression recognition base on acoustic speech model of Multi-task emotional stimulus

Journal

BIOMEDICAL SIGNAL PROCESSING AND CONTROL
Volume 85, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.bspc.2023.104970

Keywords

Depression recognition; Feature selection; Emotional stimulus; Bagging

Ask authors/readers for more resources

Depression imposes a significant burden on families and society due to its high prevalence, recurrence, and disability mortality. Researchers are increasingly focusing on using efficient and objective methods to recognize depression. Subtle changes in the speaker's physical and mental state are subconsciously reflected in their vocal apparatus. Speech signals are easily influenced by emotional stimuli, making them a substantial factor in depression recognition.
Depression places great burden on families and society owning to its high prevalence recurrence and disability mortality. Using efficient and objective methods to recognized depression has attracted more and more attention from researchers. Subtle changes in the speaker's physical and mental state will be subconsciously reflected in vocal apparatus. Individuals have different responses to different emotional stimuli. Speech signals are easily affected by emotional stimuli, and thus will have a great impact on depression recognition. This study has two aims, first was to collect speech data in different emotional stimulus (positive, neutral and negative), and explore effective feature set with strong interpretability. The second aim was to design efficient multi-task recognition model. A depression recognition method based on max-relevance and min-redundancy (mRMR) with multi-class labels (MCL-mRMR) and multi-task stimulus weighted Bagging (MTSW-Bagging) classifier was proposed. Firstly, MCL-mRMR selected features which had high correlation with emotional valence and depression, meanwhile features' dimensions decreased. Next, MTSW-Bagging classifier was designed to recognize depression, whose base classifier was composed of weighted multi-task emotional stimulus classifiers. Experimental results showed that the features selected by MCL-mRMR had higher performance with the accuracy and F1 score were increased by 5.59% and 4.2% respectively compared with the original full features. Meanwhile, our proposed method was superior to baseline method with an improvement of 13.2% and 12.8% on accuracy and F1 score respectively. Compared with state-of-the-art related methods, our method also had its superiority of strong interpretability of features and being independent of training data scale.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available