☆ 4.7 Article

Glance and gaze: A collaborative learning framework for single-channel speech enhancement

APPLIED ACOUSTICS (2022)

期刊

APPLIED ACOUSTICS

卷 187, 期 -, 页码 -

出版社

ELSEVIER SCI LTD

DOI: 10.1016/j.apacoust.2021.108499

关键词

Speech enhancement; glance and gaze; multi-stage; collaborative learning

类别

Acoustics

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

By proposing a collaborative learning framework for monaural noise suppression in the complex domain, the system achieves state-of-the-art performance through spectral feature extraction modules and stacked glance-gaze modules in experiments.

The capability of the human to pay attention to both coarse and fine-grained regions has been applied to computer vision tasks. Motivated by that, we propose a collaborative learning framework in the complex domain for monaural noise suppression. The proposed system consists of two principal modules, namely spectral feature extraction module (FEM) and stacked glance-gaze modules (GGMs). In FEM, the UNetblock is introduced after each convolution layer, enabling the feature recalibration from multiple scales. In each GGM, we decompose the multi-target optimization in the complex spectrum into two sub-tasks. Specifically, the glance path aims to suppress the noise in the magnitude domain to obtain a coarse estimation, and meanwhile, the gaze path attempts to compensate for the lost spectral detail in the complex domain. The two paths work collaboratively and facilitate spectral estimation from complementary perspectives. Besides, by repeatedly unfolding the GGMs, the intermediate result can be iteratively refined across stages and lead to the ultimate estimation of the spectrum. The experiments are conducted on the WSJ0-SI84, DNS-Challenge dataset, and Voicebank + Demand dataset. Results show that the proposed approach achieves state-of-the-art performance over previous advanced systems on the WSJ0-SI84, DNS Challenge, and Voicebank + Demand corpora. (c) 2021 Elsevier Ltd. All rights reserved.

Glance and gaze: A collaborative learning framework for single-channel speech enhancement

期刊

APPLIED ACOUSTICS

出版社

ELSEVIER SCI LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Glance and gaze: A collaborative learning framework for single-channel speech enhancement

期刊

APPLIED ACOUSTICS

出版社

ELSEVIER SCI LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文