4.7 Article

Bionic optimization of MFCC features based on speaker fast recognition

期刊

APPLIED ACOUSTICS
卷 173, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.apacoust.2020.107682

关键词

Adaptive endpoint detection; Bionic auditory curve; Improved Mel; Recognition filter; Voice signal

向作者/读者索取更多资源

This research focuses on speech recognition in low SNR environments, utilizing the characteristics of the human auditory system and bionic technology to dynamically extract Mel features and enhance sound, resulting in improved accuracy and efficiency in speech recognition.
Surrounded by low SNR, how to make the voice faster and better recognize the owner has become a heated research topic. The human auditory system can accurately acquire the characteristics of acoustic events in complex systems or low SNR noise environment, which is of significance in the research of bionic hearing of human ear. The response curve of human ear output is obtained by bionic technology, which is the best response curve for sound enhancement to modify Mel filter. The method of adaptive threshold selection is used to integrate Mel features to realize the reduction and dynamic extraction of low SNR speech features. This method not only can resist the disadvantages of poor robustness and complexity of parameter model, but also obtain dynamic and comprehensive speech information of different speakers in different scenes. Finally, the improved CNN and I-vector system are contributed to reduce the dimension of the data and to verify the recognition, so as to achieve the optimal frequency selective amplification and simplification of the acoustic signal. In the case of SNR-5db, the model is reduced by 15% and the recognition accuracy is improved by 3%. (C) 2020 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据