4.6 Article

FSCNet: Feature-Specific Convolution Neural Network for Real-Time Speech Enhancement

期刊

IEEE SIGNAL PROCESSING LETTERS
卷 28, 期 -, 页码 1958-1962

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LSP.2021.3113604

关键词

Convolution; Speech enhancement; Power capacitors; Kernel; Feature extraction; Time-frequency analysis; Decoding; Convolutional neural network; feature-specific convolution; speech enhancement

向作者/读者索取更多资源

In this paper, a novel feature-specific convolutional neural network (FSCNet) is proposed for real-time speech enhancement. By dynamically parameterizing convolution kernels, leveraging long-term global contexts, and considering the importance of each feature channel, the network outperforms existing algorithms.
In recent years, convolutional neural networks (CNNs) have been widely exploited in deep neural network (DNN)-based speech enhancement methods. However, the representation power of CNNs for speech modeling is limited because of the spatial-agnostic convolution kernels. This letter proposes a novel feature-specific convolution neural network (FSCNet) for real-time speech enhancement. In FSCNet, the encoder and decoder are adopted for forward and inverse feature space transformation, respectively. The denoising module based on the feature-specific convolution (FSC) is employed to enhance the generated deep features. Leveraging the long-term global contexts and considering the importance of each feature channel for speech modeling, the convolution kernels of FSC are dynamically parameterized in each time-frequency location. A function-constrained loss is further proposed to train the FSCNet, ensuring the encoder, denoising modules and decoder can function as expected. Experimental results show that the proposed FSCNet outperforms the state-of-the-art denoising algorithms in terms of five objective evaluation metrics and model size.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据