☆ 4.2 Article

Speaker Verification by Vowel and Nonvowel Like Segmentation

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2013)

期刊

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

卷 21, 期 4, 页码 854-867

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TASL.2013.2238529

关键词

VLRs; non-VLRs; VLROP; VLREP; speaker information; speaker verification; degraded condition

类别

Acoustics Engineering, Electrical & Electronic

资金

project titled Development of Speech based Multi-Level Person Authentication System
Department of Information Technology (DIT), New Delhi, India

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

This work proposes methods for detecting vowel-like regions (VLRs) and non-vowel-like regions (non-VLRs) using excitation source information. The VLR onset and end points are hypothesized and used in an iterative algorithm for detecting the VLRs. Next, for detection of non-VLRs, the linear prediction (LP) residual samples in the VLRs are attenuated significantly to indirectly emphasize the residual samples in the non-VLRs. The modified LP residual samples excite the time varying all pole filter to reconstruct non-VLRs enhanced speech and used for detecting non-VLRs. The VLRs and non-VLRs are used independently during training and testing of a speaker verification (SV) system to reduce gross level mismatch due to sound units and achieve better compensation of degradation effects by applying different normalization to these two different energy regions. Finally, the scores are combined with higher weight on VLRs, which are more speaker specific. Experiments verify that the proposed approach provides improved performance for clean and degraded speech. On the NIST-2003 speaker recognition database, using VLRs and non-VLRs improves the equal error rate from 6.63% to 6% and from 2.29% to 1.89% for a GMM-UBM based and an i-vector based SV system, respectively.

Speaker Verification by Vowel and Nonvowel Like Segmentation

期刊

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Speaker Verification by Vowel and Nonvowel Like Segmentation

期刊

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文