4.5 Article Proceedings Paper

Speaker recognition with temporal cues in acoustic and electric hearing

期刊

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA
卷 118, 期 2, 页码 1055-1061

出版社

ACOUSTICAL SOC AMER AMER INST PHYSICS
DOI: 10.1121/1.1944507

关键词

-

资金

  1. NIDCD NIH HHS [2R01DC02267] Funding Source: Medline

向作者/读者索取更多资源

Natural spoken language processing includes not, only speech recognition but also identification of the speaker's gender, age, emotional, and social status. Our purpose in this study is to evaluate whether temporal cues are sufficient to support both speech and speaker recognition. Ten cochlear-implant and six normal-hearing subjects were presented with vowel tokens spoken by three men, three women, two boys, and two girls. In one condition, the subject was asked to recognize the vowel. In the other condition, the subject was asked to identify the speaker. Extensive training was provided for the speaker recognition task. Normal-hearing subjects achieved nearly perfect performance in both tasks. Cochlear-implant subjects achieved good performance in vowel recognition but poor performance in speaker recognition. The level of the cochlear implant performance was functionally equivalent to normal performance with eight spectral bands for vowel recognition but only to one band for speaker recognition. These results show a disassociation between speech and speaker recognition with primarily temporal cues, highlighting the limitation of current speech processing strategies in cochlear implants. Several methods, including explicit encoding of fundamental frequency and frequency modulation, are proposed to improve speaker recognition for current cochlear implant users. (C) 2005 Acoustical Society of America.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据