☆ 4.6 Article

An unsupervised deep domain adaptation approach for robust speech recognition

NEUROCOMPUTING (2017)

期刊

NEUROCOMPUTING

卷 257, 期 -, 页码 79-87

出版社

ELSEVIER SCIENCE BV

DOI: 10.1016/j.neucom.2016.11.063

关键词

Domain adaptation; Robust speech recognition; Deep neural network; Deep learning

类别

Computer Science, Artificial Intelligence

资金

National Natural Science Foundation of China [61571363]
National High Technology Research and Development Program of China [2015AA016402]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

This paper addresses the robust speech recognition problem as a domain adaptation task. Specifically, we inttoduce an unsupervised deep domain adaptation (DDA) approach to acoustic modeling in order to eliminate the training-testing mismatch that is common in real-world use of speech recognition. Under a multi-task learning framework, the approach jointly learns two discriminative classifiers using one deep neural network (DNN). As the main task, a label predictor predicts phoneme labels and is used during training and at test time. As the second task, a domain classifier discriminates between the source and the target domains during training. The network is optimized by minimizing the loss of the label classifier and to maximize the loss of the domain classifier at the same time. The proposed approach is easy to implement by modifying a common feed-forward network. Moreover, this unsupervised approach only needs labeled training data from the source domain and some unlabeled raw data of the new domain. Speech recognition experiments on noise/channel distortion and domain shift confirm the effectiveness of the proposed approach. For instance, on the Aurora-4 corpus, compared with the acoustic model trained only using clean data, the DDA approach achieves relative 37.8% word error rate (WER) reduction. (C) 2017 Elsevier B.V. All rights reserved.

An unsupervised deep domain adaptation approach for robust speech recognition

期刊

NEUROCOMPUTING

出版社

ELSEVIER SCIENCE BV

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

An unsupervised deep domain adaptation approach for robust speech recognition

期刊

NEUROCOMPUTING

出版社

ELSEVIER SCIENCE BV

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文