Journal
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA
Volume 118, Issue 2, Pages 1055-1061Publisher
ACOUSTICAL SOC AMER AMER INST PHYSICS
DOI: 10.1121/1.1944507
Keywords
-
Categories
Funding
- NIDCD NIH HHS [2R01DC02267] Funding Source: Medline
Ask authors/readers for more resources
Natural spoken language processing includes not, only speech recognition but also identification of the speaker's gender, age, emotional, and social status. Our purpose in this study is to evaluate whether temporal cues are sufficient to support both speech and speaker recognition. Ten cochlear-implant and six normal-hearing subjects were presented with vowel tokens spoken by three men, three women, two boys, and two girls. In one condition, the subject was asked to recognize the vowel. In the other condition, the subject was asked to identify the speaker. Extensive training was provided for the speaker recognition task. Normal-hearing subjects achieved nearly perfect performance in both tasks. Cochlear-implant subjects achieved good performance in vowel recognition but poor performance in speaker recognition. The level of the cochlear implant performance was functionally equivalent to normal performance with eight spectral bands for vowel recognition but only to one band for speaker recognition. These results show a disassociation between speech and speaker recognition with primarily temporal cues, highlighting the limitation of current speech processing strategies in cochlear implants. Several methods, including explicit encoding of fundamental frequency and frequency modulation, are proposed to improve speaker recognition for current cochlear implant users. (C) 2005 Acoustical Society of America.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available