☆ 4.6 Article

From Microphone to Phoneme: An End-to-End Computational Neural Model for Predicting Speech Perception With Cochlear Implants

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING (2022)

期刊

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING

卷 69, 期 11, 页码 3300-3312

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TBME.2022.3167113

关键词

Biological system modeling; Computational modeling; Predictive models; Speech recognition; Finite element analysis; Ear; Speech processing; Neural prostheses; cochlear implants; computational models; automatic speech recognition; signal processing; information transmission; neural networks

类别

Engineering, Biomedical

资金

Cambridge Hearing Trust
Evelyn Trust
William Demont Foundation
HB Allen Trust
Rosetrees Trust Enterprise Fellowship [EF2020\100099]
RNID Flexigrant [F112]
Medical Research Council (MRC), U.K [MR/T03095X/1]
MRC Senior Fellowship in Hearing [MR/S002537/1]
National Institute Health Research Programme Grant for Applied Research [NIHR 201608]
Wellcome Trust [204845/Z/16/Z]
Wellcome Trust [204845/Z/16/Z] Funding Source: Wellcome Trust

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The study presented an end-to-end computational model for predicting speech perception with cochlear implants, which successfully replicated phoneme-level speech perception patterns and identified the bottleneck of information flow at the electrode-neural interface.

Goal: Advances in computational models of biological systems and artificial neural networks enable rapid virtual prototyping of neuroprostheses, accelerating innovation in the field. Here, we present an end-to-end computational model for predicting speech perception with cochlear implants (CI), the most widely-used neuroprosthesis. Methods: The model integrates CI signal processing, a finite element model of the electrically-stimulated cochlea, and an auditory nerve model to predict neural responses to speech stimuli. An automatic speech recognition neural network is then used to extract phoneme-level speech perception from these neural response patterns. Results: Compared to human CI listener data, the model predicts similar patterns of speech perception and misperception, captures between-phoneme differences in perceptibility, and replicates effects of stimulation parameters and noise on speech recognition. Information transmission analysis at different stages along the CI processing chain indicates that the bottleneck of information flow occurs at the electrode-neural interface, corroborating studies in CI listeners. Conclusion: An end-to-end model of CI speech perception replicated phoneme-level CI speech perception patterns, and was used to quantify information degradation through the CI processing chain. Significance: This type of model shows great promise for developing and optimizing new and existing neuroprostheses.

From Microphone to Phoneme: An End-to-End Computational Neural Model for Predicting Speech Perception With Cochlear Implants

期刊

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

From Microphone to Phoneme: An End-to-End Computational Neural Model for Predicting Speech Perception With Cochlear Implants

期刊

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文