☆ 4.6 Article

A tractable framework for estimating and combining spectral source models for audio source separation

SIGNAL PROCESSING (2012)

期刊

SIGNAL PROCESSING

卷 92, 期 8, 页码 1886-1901

出版社

ELSEVIER

DOI: 10.1016/j.sigpro.2011.12.022

关键词

Blind source separation; Multichannel audio; Gaussian mixture model; Expectation-maximization algorithm; Convolutive mixture

类别

Engineering, Electrical & Electronic

资金

EU [FP7-ICT-225913-SMALL]
OSEO, the French State agency for innovation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The underdetermined blind audio source separation (BSS) problem is often addressed in the time-frequency (TF) domain assuming that each TF point is modeled as an independent random variable with sparse distribution. On the other hand, methods based on structured spectral model, such as the Spectral Gaussian Scaled Mixture Models (Spectral-GSMMs) or Spectral Non-negative Matrix Factorization models, perform better because they exploit the statistical diversity of audio source spectrograms, thus allowing to go beyond the simple sparsity assumption. However, in the case of discrete state-based models, such as Spectral-GSMMs, learning the models from the mixture can be computationally very expensive. One of the main problems is that using a classical Expectation-Maximization procedure often leads to an exponential complexity with respect to the number of sources. In this paper, we propose a framework with a linear complexity to learn spectral source models (including discrete state-based models) from noisy source estimates. Moreover, this framework allows combining different probabilistic models that can be seen as a sort of probabilistic fusion. We illustrate that methods based on this framework can significantly improve the BSS performance compared to the state-of-the-art approaches. (c) 2012 Elsevier B.V. All rights reserved.

A tractable framework for estimating and combining spectral source models for audio source separation

期刊

SIGNAL PROCESSING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A tractable framework for estimating and combining spectral source models for audio source separation

期刊

SIGNAL PROCESSING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文