☆ 4.6 Article

FSCNet: Feature-Specific Convolution Neural Network for Real-Time Speech Enhancement

IEEE SIGNAL PROCESSING LETTERS (2021)

期刊

IEEE SIGNAL PROCESSING LETTERS

卷 28, 期 -, 页码 1958-1962

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/LSP.2021.3113604

关键词

Convolution; Speech enhancement; Power capacitors; Kernel; Feature extraction; Time-frequency analysis; Decoding; Convolutional neural network; feature-specific convolution; speech enhancement

类别

Engineering, Electrical & Electronic

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this paper, a novel feature-specific convolutional neural network (FSCNet) is proposed for real-time speech enhancement. By dynamically parameterizing convolution kernels, leveraging long-term global contexts, and considering the importance of each feature channel, the network outperforms existing algorithms.

In recent years, convolutional neural networks (CNNs) have been widely exploited in deep neural network (DNN)-based speech enhancement methods. However, the representation power of CNNs for speech modeling is limited because of the spatial-agnostic convolution kernels. This letter proposes a novel feature-specific convolution neural network (FSCNet) for real-time speech enhancement. In FSCNet, the encoder and decoder are adopted for forward and inverse feature space transformation, respectively. The denoising module based on the feature-specific convolution (FSC) is employed to enhance the generated deep features. Leveraging the long-term global contexts and considering the importance of each feature channel for speech modeling, the convolution kernels of FSC are dynamically parameterized in each time-frequency location. A function-constrained loss is further proposed to train the FSCNet, ensuring the encoder, denoising modules and decoder can function as expected. Experimental results show that the proposed FSCNet outperforms the state-of-the-art denoising algorithms in terms of five objective evaluation metrics and model size.

FSCNet: Feature-Specific Convolution Neural Network for Real-Time Speech Enhancement

期刊

IEEE SIGNAL PROCESSING LETTERS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

FSCNet: Feature-Specific Convolution Neural Network for Real-Time Speech Enhancement

期刊

IEEE SIGNAL PROCESSING LETTERS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文