期刊
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH
卷 50, 期 6, 页码 1445-1465出版社
AMER SPEECH-LANGUAGE-HEARING ASSOC
DOI: 10.1044/1092-4388(2007/100)
关键词
intelligibility; synthetic speech; listener training; perceptual learning
资金
- NIDCD NIH HHS [R03 DC006811] Funding Source: Medline
Purpose: Investigate training-related changes in acoustic-phonetic representation of consonants produced by a text-to-speech (TTS) computer speech synthesizer. Method: Forty-eight adult listeners were trained to better recognize words produced by a TTS system. Nine additional untrained participants served as controls. Before and after training, participants were tested on consonant recognition and made pairwise judgments of consonant dissimilarity for subsequent multidimensional scaling (MDS) analysis. Results: Word recognition training significantly improved performance on consonant identification, although listeners never received specific training on phoneme recognition. Data from 31 participants showing clear evidence of learning (improvement >= 10 percentage points) were further investigated using MDS and analysis of confusion matrices. Results show that training altered listeners' treatment of particular acoustic cues, resulting in both increased within-class similarity and between-class distinctiveness. Some changes were consistent with current models of perceptual learning, but others were not. Conclusion: Training caused listeners to interpret the acoustic properties of synthetic speech more like those of natural speech, in a manner consistent with a flexible-feature model of perceptual learning. Further research is necessary to refine these conclusions and to investigate their applicability to other training-related changes in intelligibility (e. g., associated with learning to better understand dysarthric speech or foreign accents).
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据