4.6 Article

Diagnose Parkinson?s disease and cleft lip and palate using deep convolutional neural networks evolved by IP-based chimp optimization algorithm

期刊

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.bspc.2022.103688

关键词

Pathological speech; Deep convolutional neural networks; Parkinson?s disease; Cleft lip and palate; Chimp optimization algorithm

资金

  1. Zhejiang Soft Science Research Program [2021C35016]
  2. Zhejiang Educational Science Planning Grant [2020SCG163]
  3. Zhejiang, China

向作者/读者索取更多资源

This paper presents the use of deep convolutional neural networks (DCNNs) and the chimp optimization algorithm (ChOA) to recognize disordered speech. Several advancements using ChOA are proposed to optimize the DCNN structure. Experimental results show that the proposed model accurately diagnoses abnormal speech signals from patients with Parkinson's disease and cleft lip and palate.
Speech signals often include paralinguistic features such as pathologies that impair a speaker's capability to communicate. Those cognitive symptoms have various causes depending on the disease. For example, morphological diseases like cleft lip and palate create hypernasality, while neurodegenerative conditions like Parkinson's disease cause hypokinetic dysarthria. Automatic assessment of abnormal speech supports early diagnosis or disease severity evaluation. Conventional methods rely on manually assessing single aspects like shimmer, jitter, or formant frequencies, which may not fully reflect the disease's manifestations. In this paper, we use deep convolutional neural networks (DCNNs) to recognize disordered speech. Despite DCNNs' many approved benefits, selecting the best structure is challenging. In order to overcome this issue, this research looks into using the chimp optimization algorithm (ChOA) to automatically select the optimal DCNN structure. In order to achieve the goal, three ChOA-based advancements are proposed. First, an internet protocol address-based (IPA-based) encoding method for DCNN layers employing chimp vectors is created. Then an Enfeebled layer with specified chimp vector dimensions is presented for variable-length DCNNs. Eventually, large datasets are partitioned into smaller ones and evaluated at random to recognize abnormal speech signals from patients with Parkinson's disease and cleft lip and palate. In addition to receiver operating characteristic (ROC) and precision-recall curves, five well-known metrics were used: sensitivity, specificity, accuracy, precision, F1-Score. The proposed model accurately diagnoses disordered and normal speech signals, with an accuracy of up to 96.37%, which is 1.62 more accurate than the second-best approach, i.e., VLNSGA-II.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据