☆ 4.6 Article

Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization

DIGITAL SIGNAL PROCESSING (2013)

期刊

DIGITAL SIGNAL PROCESSING

卷 23, 期 2, 页码 646-658

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

DOI: 10.1016/j.dsp.2012.10.001

关键词

Blind audio source separation; Non-negative Matrix Factorization; Clustering; Perceptual quality

类别

Engineering, Electrical & Electronic

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We propose a new approach that improves perceptual quality of the separated sources in blind single-channel musical source separation. It uses the advantages of subspace learning based on Non-negative Matrix Factorization (NMF) in which the bases represent the notes. The cost function is formulated in the form of weighted beta-divergence by adopting the PEAQ auditory model defined in ITU-R BS.1387 into the source separation. The proposed perceptually weighted factorization scheme is integrated into the Non-negative Matrix Factor 2-D Deconvolution (NMF2D) and Clustered Non-negative Matrix Factorization (CNMF) to overcome the source clustering problem encountered in under-determined source separation. It is shown that the introduced perceptually weighted NMF schemes, named as PW-NMF2D and PW-CNMF, efficiently learn the bases that enable us to apply a simple resynthesis of the musical sources based on the temporal model stored in the encoding matrix. Source separation performance has been reported on musical mixtures where 1-2 dB improvement is achieved in terms of SDR, SIR and SAR compared to the state-of-the-art methods. Performance has also been evaluated by perceptual measures resulting an improvement of 2-5 in OPS, TPS, IPS and APS values. (C) 2012 Elsevier Inc. All rights reserved.

Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization

期刊

DIGITAL SIGNAL PROCESSING

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization

期刊

DIGITAL SIGNAL PROCESSING

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文