3.8 Article

A comparison of three spectral features for phone recognition in sub-optimal environments

期刊

出版社

INDERSCIENCE ENTERPRISES LTD
DOI: 10.1504/IJAPR.2018.092522

关键词

mel-frequency cepstrum coefficients; MFCC; perceptual linear prediction; PLP; linear prediction cepstral coefficients; LPCC; speech features; phonetic engine; hidden Markov model; HMM; HTK toolkit

向作者/读者索取更多资源

This paper presents a comparison of three spectral features for automatic phone recognition in sub-optimal environments. An exclusive study is carried out with a phone recognition system called phonetic engine (PE) developed in the Manipuri language. The Manipuri language is a scheduled Indian language being used as the official language in the State of Manipur. However, there is no standard database of the language so far. Therefore, a PE has been built for this language. Here phonetic transcriptions are done and then modeling of each phonetic unit is carried out using hidden Markov model (HMM). Speech feature extraction is a very important stage in the development of such a PE. An analysis of phone recognition accuracies of the PE due the three dominant spectral features: MFCC, PLP and LPCC have been studied here. It is found that PLP and MFCC outperform LPCC features under all circumstances.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据