4.6 Article

Distant-talking accent recognition by combining GMM and DNN

期刊

MULTIMEDIA TOOLS AND APPLICATIONS
卷 75, 期 9, 页码 5109-5124

出版社

SPRINGER
DOI: 10.1007/s11042-015-2935-4

关键词

Accent recognition; GMM; DNN; Distant-talking speech; Machine learning

资金

  1. JSPS KANKENHI [15K16020]
  2. Telecommunications Advancement Foundation (TAF), Japan
  3. Grants-in-Aid for Scientific Research [15K16020] Funding Source: KAKEN

向作者/读者索取更多资源

Recently, automatic accent recognition has been paid more and more attentions. However, there are few researches focusing on accent recognition in distant-talking environment which is very important for improving distant-talking speech recognition performance with non-native accents. In this paper, we apply Gaussian Mixture Models (GMM) and Deep Neural Network (DNN) to identify the speaker accent in reverberant environments. The combination of likelihood with these two approaches is also proposed. In reverberant environment, the accent recognition rate was improved from 90.7 % with GMM to 93.0 % with DNN. The combination of GMM and DNN achieved recognition rate of 97.5 %, which outperformed than the individual GMM and DNN because the complementation of GMM and DNN. The relative error reduction is 73.1 % than the GMM-based method and 64.3 % than the DNN-based method, respectively.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据