☆ 4.4 Article

End Point Detection Using Speech-Specific Knowledge for Text-Dependent Speaker Verification

CIRCUITS SYSTEMS AND SIGNAL PROCESSING (2018)

Journal

CIRCUITS SYSTEMS AND SIGNAL PROCESSING

Volume 37, Issue 12, Pages 5507-5539

Publisher

SPRINGER BIRKHAUSER

DOI: 10.1007/s00034-018-0827-3

Keywords

End point detection; Vowel-like region; Glottal activity; Dominant resonant frequency; Foreground speech segmentation; Speech duration knowledge

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

This paper proposes a method using speech-specific knowledge to detect the begin and end points of speech under degraded condition. The method is based on vowel-like region (VLR) detection and uses both excitation source and vocal tract system information. Existing method for VLR detection uses excitation source information. Vocal tract system information from dominant resonant frequency is used to eliminate spurious VLRs in background noise. Foreground speech segmentation using excitation and vocal tract system information is carried out to remove spurious VLRs in the background speech region. Better localization of the end points is done using more detailed information about excitation source in terms of glottal activity to detect the sonorant consonants and missed VLRs. To include an unvoiced consonant, obstruent region detection is done at the beginning of the first VLR and at the end of last VLR. Detected begin and end points are evaluated by comparing with manually marked end points as well as by conducting the text-dependent speaker verification experiments. The proposed method performs better than some of the existing techniques.

End Point Detection Using Speech-Specific Knowledge for Text-Dependent Speaker Verification

Journal

CIRCUITS SYSTEMS AND SIGNAL PROCESSING

Publisher

SPRINGER BIRKHAUSER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

End Point Detection Using Speech-Specific Knowledge for Text-Dependent Speaker Verification

Journal

CIRCUITS SYSTEMS AND SIGNAL PROCESSING

Publisher

SPRINGER BIRKHAUSER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper