4.7 Article

A Language Identification System using Hybrid Features and Back-Propagation Neural Network

Journal

APPLIED ACOUSTICS
Volume 164, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.apacoust.2020.107289

Keywords

Language identification; Hybrid feature extraction techniques; Mel-frequency cepstral coefficient; Feed-forward back propagation neural network

Categories

Ask authors/readers for more resources

Language Identification (LID) is accurate identification of the unknown language by comparison of speech biometrics of test speech sample and language models accumulated beforehand. This paper presents and encourages the use of hybrid robust feature extraction techniques for spoken language identification (LID) system. In the feature extraction stage, different techniques are applied individually such as Mel frequency cepstral coefficients (MFCCs), perceptual linear prediction features (PLP), relative perceptual linear prediction features (RASTA-PLP). Later, performance of our LID system based on several combinations of the different features (hybrid features) are investigated such as MFCC, PLP, combined with their 1st order derivatives, MFCC + RASTA-PLP, MFCC + SDC (Shifted delta cepstral coefficients). Language identification phase or classification utilizes feed forward back-propagation neural network (FFBPNN) and comparison is based on two learning algorithms: the Levenberg-Marquardt trainlm and the scaled conjugate gradient trainscg. A comparative analysis in terms of performance is done between different hybrid feature extraction techniques and their individual counterparts. Results clearly indicates that improved performance is obtained with hybrid features with trainlm learning algorithm as compared to their individual counterparts. The results are very promising with MFCC-RASTA-PLP hybrid feature extraction technique in comparison to the other hybrid feature extraction techniques with overall accuracy of 94.6% and a minimum test error rate of 0.10. The efficiency of proposed hybrid approaches is determined by simulating several experiments on a user defined language database of speech signals in the working platform of MATLAB. (C) 2020 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available