☆ 4.5 Article

A multi-resolution envelope-power based model for speech intelligibility

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2013)

Journal

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA

Volume 134, Issue 1, Pages 436-446

Publisher

ACOUSTICAL SOC AMER AMER INST PHYSICS

DOI: 10.1121/1.4807563

Keywords

Funding

Danish Research Foundation
Oticon
Widex
GN ReSound
Deutsche Forschungsgemeinschaft (DFG
Individualisierte Horakustik, TPE)) [FOR 1732]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

The speech-based envelope power spectrum model (sEPSM) presented by Jorgensen and Dau [(2011). J. Acoust. Soc. Am. 130, 1475-1487] estimates the envelope power signal-to-noise ratio (SNRenv) after modulation-frequency selective processing. Changes in this metric were shown to account well for changes of speech intelligibility for normal-hearing listeners in conditions with additive stationary noise, reverberation, and nonlinear processing with spectral subtraction. In the latter condition, the standardized speech transmission index [(2003). IEC 60268-16] fails. However, the sEPSM is limited to conditions with stationary interferers, due to the long-term integration of the envelope power, and cannot account for increased intelligibility typically obtained with fluctuating maskers. Here, a multi-resolution version of the sEPSM is presented where the SNRenv is estimated in temporal segments with a modulation-filter dependent duration. The multi-resolution sEPSM is demonstrated to account for intelligibility obtained in conditions with stationary and fluctuating interferers, and noisy speech distorted by reverberation or spectral subtraction. The results support the hypothesis that the SNRenv is a powerful objective metric for speech intelligibility prediction. (C) 2013 Acoustical Society of America.

A multi-resolution envelope-power based model for speech intelligibility

Journal

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA

Publisher

ACOUSTICAL SOC AMER AMER INST PHYSICS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A multi-resolution envelope-power based model for speech intelligibility

Journal

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA

Publisher

ACOUSTICAL SOC AMER AMER INST PHYSICS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper