4.6 Article Proceedings Paper

Direct classification of all American English phonemes using signals from functional speech motor cortex

期刊

JOURNAL OF NEURAL ENGINEERING
卷 11, 期 3, 页码 -

出版社

IOP PUBLISHING LTD
DOI: 10.1088/1741-2560/11/3/035015

关键词

electrocorticography; speech production; phonemes; linear discriminant analysis; brain-computer interface

资金

  1. Doris Duke Charitable Foundation [2011039]
  2. National Science Foundation [0549489, 0718558, 1064912]
  3. National Institutes of Health [NIBIB/NINDS EB00856]
  4. Mayo Clinic Foundation CR20 Grant
  5. Direct For Education and Human Resources
  6. Division Of Graduate Education [0549489] Funding Source: National Science Foundation
  7. Division Of Mathematical Sciences
  8. Direct For Mathematical & Physical Scien [0718558] Funding Source: National Science Foundation
  9. Div Of Information & Intelligent Systems
  10. Direct For Computer & Info Scie & Enginr [1064912] Funding Source: National Science Foundation

向作者/读者索取更多资源

Objective. Although brain-computer interfaces (BCIs) can be used in several different ways to restore communication, communicative BCI has not approached the rate or efficiency of natural human speech. Electrocorticography (ECoG) has precise spatiotemporal resolution that enables recording of brain activity distributed over a wide area of cortex, such as during speech production. In this study, we sought to decode elements of speech production using ECoG. Approach. We investigated words that contain the entire set of phonemes in the general American accent using ECoG with four subjects. Using a linear classifier, we evaluated the degree to which individual phonemes within each word could be correctly identified from cortical signal. Main results. We classified phonemes with up to 36% accuracy when classifying all phonemes and up to 63% accuracy for a single phoneme. Further, misclassified phonemes follow articulation organization described in phonology literature, aiding classification of whole words. Precise temporal alignment to phoneme onset was crucial for classification success. Significance. We identified specific spatiotemporal features that aid classification, which could guide future applications. Word identification was equivalent to information transfer rates as high as 3.0 bits s(-1) (33.6 words min(-1)), supporting pursuit of speech articulation for BCI control.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据