☆ 4.5 Article

Robust Speech Perception: Recognize the Familiar, Generalize to the Similar, and Adapt to the Novel

PSYCHOLOGICAL REVIEW (2015)

Journal

PSYCHOLOGICAL REVIEW

Volume 122, Issue 2, Pages 148-203

Publisher

AMER PSYCHOLOGICAL ASSOC

DOI: 10.1037/a0038695

Keywords

speech perception; generalization; adaptation; statistical learning; lack of invariance

Funding

National Science Foundation (NSF)
National Institute of Child Health and Human Development (NICHD) [R01 HD075797]
Alfred P. Sloan Fellowship
Div Of Information & Intelligent Systems
Direct For Computer & Info Scie & Enginr [1150028] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Successful speech perception requires that listeners map the acoustic signal to linguistic categories. These mappings are not only probabilistic, but change depending on the situation. For example, one talker's /p/ might be physically indistinguishable from another talker's /b/(cf. lack of invariance). We characterize the computational problem posed by such a subjectively nonstationary world and propose that the speech perception system overcomes this challenge by (a) recognizing previously encountered situations, (b) generalizing to other situations based on previous similar experience, and (c) adapting to novel situations. We formalize this proposal in the ideal adapter framework: (a) to (c) can be understood as inference under uncertainty about the appropriate generative model for the current talker, thereby facilitating robust speech perception despite the lack of invariance. We focus on 2 critical aspects of the ideal adapter. First, in situations that clearly deviate from previous experience, listeners need to adapt. We develop a distributional (belief-updating) learning model of incremental adaptation. The model provides a good fit against known and novel phonetic adaptation data, including perceptual recalibration and selective adaptation. Second, robust speech recognition requires that listeners learn to represent the structured component of cross-situation variability in the speech signal. We discuss how these 2 aspects of the ideal adapter provide a unifying explanation for adaptation, talker-specificity, and generalization across talkers and groups of talkers (e.g., accents and dialects). The ideal adapter provides a guiding framework for future investigations into speech perception and adaptation, and more broadly language comprehension.

Robust Speech Perception: Recognize the Familiar, Generalize to the Similar, and Adapt to the Novel

Journal

PSYCHOLOGICAL REVIEW

Publisher

AMER PSYCHOLOGICAL ASSOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Robust Speech Perception: Recognize the Familiar, Generalize to the Similar, and Adapt to the Novel

Journal

PSYCHOLOGICAL REVIEW

Publisher

AMER PSYCHOLOGICAL ASSOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper