4.6 Article

Combining evidence from residual phase and MFCC features for speaker recognition

期刊

IEEE SIGNAL PROCESSING LETTERS
卷 13, 期 1, 页码 52-55

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LSP.2005.860538

关键词

autoassociative neural network; glottal closure instant; linear prediction (LP) residual; residual phase; speaker verification

向作者/读者索取更多资源

The objective of this letter is to demonstrate the complementary nature of speaker-specific information present in the residual,phase in comparison with the information present in the conventional mel-frequency cepstral coefficients (MFCCs). The residual phase is derived from speech signal by linear prediction analysis. Speaker recognition studies are conducted on the NIST-2003 database using the proposed residual phase and the existing MFCC features. The speaker recognition system based on the residual phase gives an equal error rate (EER) of 22%, and the system using the MFCC features gives an EER of 14%. By combining the evidence from both the residual phase and the MFCC features, an EER of 10.5% is obtained, indicating that speaker-specific excitation information is present in the residual phase. This information is useful since it is complementary to that of MFCCs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据