4.7 Article

Speech Categorization Reveals the Role of Early-Stage Temporal-Coherence Processing in Auditory Scene Analysis

期刊

JOURNAL OF NEUROSCIENCE
卷 42, 期 2, 页码 240-254

出版社

SOC NEUROSCIENCE
DOI: 10.1523/JNEUROSCI.1610-21.2021

关键词

cochlear nucleus; comodulation mastring release; computational modeling; consonant confusions; cross-channel processing; wideband inhibition

资金

  1. National Institutes of Health [F31DC017381, R01DC009838]
  2. Office of Naval Research Grant ONR [N00014-20-12709]

向作者/读者索取更多资源

Temporal coherence of sound fluctuations plays an important role in auditory grouping and scene segregation, not only in the cortex but also in early auditory areas. By studying consonant categorization and using computational models, it was found that speech understanding in noise is influenced by temporal-coherence processing, and physiological computations in the early auditory pathway may contribute to this process.
Temporal coherence of sound fluctuations across spectral channels is thought to aid auditory grouping and scene segregation. Although prior studies on the neural bases of temporal-coherence processing focused mostly on cortical contributions, neuro-physiological evidence suggests that temporal-coherence-based scene analysis may start as early as the cochlear nucleus (i.e., the first auditory region supporting cross-channel processing over a wide frequency range). Accordingly, we hypothesized that aspects of temporal-coherence processing that could be realized in early auditory areas may shape speech understanding in noise. We then explored whether physiologically plausible computational models could account for results from a behav-ioral experiment that measured consonant categorization in different masking conditions. We tested whether within-channel masking of target-speech modulations predicted consonant confusions across the different conditions and whether predictions were improved by adding across-channel temporal-coherence processing mirroring the computations known to exist in the cochlear nucleus. Consonant confusions provide a rich characterization of error patterns in speech categorization, and are thus crucial for rigorously testing models of speech perception; however, to the best of our knowledge, they have not been used in prior studies of scene analysis. We find that within-channel modulation masking can reasonably account for category confusions, but that it fails when temporal fine structure cues are unavailable. However, the addition of across-channel tem-poral-coherence processing significantly improves confusion predictions across all tested conditions. Our results suggest that temporal-coherence processing strongly shapes speech understanding in noise and that physiological computations that exist early along the auditory pathway may contribute to this process.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据