期刊
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA
卷 148, 期 5, 页码 EL414-EL419出版社
ACOUSTICAL SOC AMER AMER INST PHYSICS
DOI: 10.1121/10.0002462
关键词
-
资金
- National Science Foundation (NSF)
- British Telecom India Research Center (BTIRC)
A listening test is proposed in which human participants detect talker changes in two natural, multi-talker speech stimuli sets-a familiar language (English) and an unfamiliar language (Chinese). Miss rate, false-alarm rate, and response times (RT) showed a significant dependence on language familiarity. Linear regression modeling of RTs using diverse acoustic features derived from the stimuli showed recruitment of a pool of acoustic features for the talker change detection task. Further, benchmarking the same task against the state-of-the-art machine diarization system showed that the machine system achieves human parity for the familiar language but not for the unfamiliar language. (C) 2020 Acoustical Society of America.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据