☆ 3.8 Proceedings Paper

Bipolar Disorder Recognition via Multi-scale Discriminative Audio Temporal Representation

PROCEEDINGS OF THE 2018 AUDIO/VISUAL EMOTION CHALLENGE AND WORKSHOP (AVEC'18) (2018)

期刊

PROCEEDINGS OF THE 2018 AUDIO/VISUAL EMOTION CHALLENGE AND WORKSHOP (AVEC'18)

卷 -, 期 -, 页码 23-30

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3266302.3268997

关键词

Bipolar Disorder Recognition; Multi-scale Temporal Modeling; IncepLSTM; Severity-sensitive Loss

类别

Computer Science, Artificial Intelligence Computer Science, Theory & Methods Engineering, Electrical & Electronic

资金

National Natural Science Foundation of China [61673033]
Research Program of State Key Laboratory of Software Development Environment [SKLSDE-2017ZX-07]
Microsoft Research Asia Collaborative Program [FY17-RES-THEME033]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Bipolar disorder (BD) is a prevalent mental illness which has a negative impact on work and social function. However, bipolar symptoms are episodic, especially with irregular variations among different episodes, making BD very difficult to be diagnosed accurately. To solve this problem, this paper presents a novel audio-based approach, called IncepLSTM, which effectively integrates Inception module and Long Short-Term Memory (LSTM) on the feature sequence to capture multi-scale temporal information for BD recognition. Moreover, in order to obtain a discriminative representation of BD severity, we propose a novel severity-sensitive loss based on the triplet loss to model the inter-severity relationship. Considering the small scale of existing BD corpus, to avoid overfitting, we also make use of L1 regulation to improve the sparsity of IncepLSTM. The evaluations are conducted on the Audio/Visual Emotion Challenge (AVEC) 2018 Dataset and the experimental results clearly demonstrate the effectiveness of our method.

Bipolar Disorder Recognition via Multi-scale Discriminative Audio Temporal Representation

期刊

PROCEEDINGS OF THE 2018 AUDIO/VISUAL EMOTION CHALLENGE AND WORKSHOP (AVEC'18)

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Bipolar Disorder Recognition via Multi-scale Discriminative Audio Temporal Representation

期刊

PROCEEDINGS OF THE 2018 AUDIO/VISUAL EMOTION CHALLENGE AND WORKSHOP (AVEC'18)

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文