☆ 4.7 Article

Acoustic Space Learning for Sound-Source Separation and Localization on Binaural Manifolds

INTERNATIONAL JOURNAL OF NEURAL SYSTEMS (2015)

Journal

INTERNATIONAL JOURNAL OF NEURAL SYSTEMS

Volume 25, Issue 1, Pages -

Publisher

WORLD SCIENTIFIC PUBL CO PTE LTD

DOI: 10.1142/S0129065714400036

Keywords

Binaural hearing; sound localization; sound-source separation; manifold learning; mixture of regressors; EM inference

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

In this paper, we address the problems of modeling the acoustic space generated by a full-spectrum sound source and using the learned model for the localization and separation of multiple sources that simultaneously emit sparse-spectrum sounds. We lay theoretical and methodological grounds in order to introduce the binaural manifold paradigm. We perform an in-depth study of the latent low-dimensional structure of the high-dimensional interaural spectral data, based on a corpus recorded with a human-like audiomotor robot head. A nonlinear dimensionality reduction technique is used to show that these data lie on a two-dimensional (2D) smooth manifold parameterized by the motor states of the listener, or equivalently, the sound-source directions. We propose a probabilistic piecewise affine mapping model (PPAM) specifically designed to deal with high-dimensional data exhibiting an intrinsic piecewise linear structure. We derive a closed-form expectation-maximization (EM) procedure for estimating the model parameters, followed by Bayes inversion for obtaining the full posterior density function of a sound-source direction. We extend this solution to deal with missing data and redundancy in real-world spectrograms, and hence for 2D localization of natural sound sources such as speech. We further generalize the model to the challenging case of multiple sound sources and we propose a variational EM framework. The associated algorithm, referred to as variational EM for source separation and localization (VESSL) yields a Bayesian estimation of the 2D locations and time-frequency masks of all the sources. Comparisons of the proposed approach with several existing methods reveal that the combination of acoustic-space learning with Bayesian inference enables our method to outperform state-of-the-art methods.

Acoustic Space Learning for Sound-Source Separation and Localization on Binaural Manifolds

Journal

INTERNATIONAL JOURNAL OF NEURAL SYSTEMS

Publisher

WORLD SCIENTIFIC PUBL CO PTE LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Acoustic Space Learning for Sound-Source Separation and Localization on Binaural Manifolds

Journal

INTERNATIONAL JOURNAL OF NEURAL SYSTEMS

Publisher

WORLD SCIENTIFIC PUBL CO PTE LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper