3.8 Article

A comparative study of different features for isolated spoken word recognition using HMM with reference to Assamese language

Journal

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY
Volume 18, Issue 4, Pages 673-684

Publisher

SPRINGER
DOI: 10.1007/s10772-015-9311-7

Keywords

Speech recognition; Isolated word; Speaker independent; LPC; LPCEPSTRA; MELSPEC; MFCC; HMM; HTK; Assamese language

Ask authors/readers for more resources

This paper describes the work done in implementation of speaker independent, isolated word recognizer for Assamese language. Linear predictive coding (LPC) analysis, LPC cepstral coefficients (LPCEPSTRA), linear mel-filter bank channel outputs and mel frequency cepstral coefficients (MFCC) are used to get the acoustical features. The hidden Markov model toolkit (HTK) using the Hidden Markov Model (HMM) has been used to build the different recognition models. The speech recognition model is trained for 10 Assamese words representing the digits from 0 (shounya) to 9 (no) in the Assamese language using fifteen speakers. Different models were created for each word which varied on the number of input feature values and the number of hidden states. The system obtained a maximum accuracy of 80 % for 39 MFCC features and a 7 state HMM model with 5 hidden states for a system with clean data and a maximum accuracy of 95 % for 26 LPCESPTRA features and a 7 state HMM model with 5 hidden states for a system with noisy data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available