Journal
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Volume 19, Issue 8, Pages 2552-2565Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TASL.2011.2155061
Keywords
Degraded condition; speaker information; speaker verification (SV); vowel-like region (VLR); vowel-like region onset point
Categories
Funding
- Development of Person Authentication System based on Speaker Verification in Uncontrolled Environment
- Department of Information Technology (DIT), New Delhi, India
Ask authors/readers for more resources
Vowel-like regions (VLRs) in speech includes vowels, semi-vowels, and diphthong sound units. VLR can be identified using a vowel-like region onset point (VLROP) event. By production, the VLR has impulse-like excitation and therefore information about the vocal tract system may be better manifested in them. Also, the VLR is a relatively high signal-to-noise ratio (SNR) region. Speaker information extracted from such a region may therefore be more speaker discriminative and relatively less affected by the degradations like noise, reverberation, and sensor mismatches. Due to this, better speaker modeling and reliable testing may be possible. In this paper, VLRs are detected using the knowledge of VLROPs during training and testing. Features from the VLRs are then used for training and testing the speaker models. As a result, significant improvement in the performance is reported for speaker verification under degraded conditions.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available