☆ 4.6 Article

Speaker recognition with hybrid features from a deep belief network

NEURAL COMPUTING & APPLICATIONS (2018)

期刊

NEURAL COMPUTING & APPLICATIONS

卷 29, 期 6, 页码 13-19

出版社

SPRINGER LONDON LTD

DOI: 10.1007/s00521-016-2501-7

关键词

Deep belief networks; Deep learning; Mel-frequency cepstral coefficients

类别

Computer Science, Artificial Intelligence

资金

Erasmus Mundus Strong Ties Grant
UK AHRC [AH/L01016X/1]
UK RAEng Research Fellowship [RF/128]
AHRC [AH/L01016X/1] Funding Source: UKRI

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Learning representation from audio data has shown advantages over the handcrafted features such as mel-frequency cepstral coefficients (MFCCs) in many audio applications. In most of the representation learning approaches, the connectionist systems have been used to learn and extract latent features from the fixed length data. In this paper, we propose an approach to combine the learned features and the MFCC features for speaker recognition task, which can be applied to audio scripts of different lengths. In particular, we study the use of features from different levels of deep belief network for quantizing the audio data into vectors of audio word counts. These vectors represent the audio scripts of different lengths that make them easier to train a classifier. We show in the experiment that the audio word count vectors generated from mixture of DBN features at different layers give better performance than the MFCC features. We also can achieve further improvement by combining the audio word count vector and the MFCC features.

Speaker recognition with hybrid features from a deep belief network

期刊

NEURAL COMPUTING & APPLICATIONS

出版社

SPRINGER LONDON LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Speaker recognition with hybrid features from a deep belief network

期刊

NEURAL COMPUTING & APPLICATIONS

出版社

SPRINGER LONDON LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文