☆ 4.7 Article

Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees

PATTERN RECOGNITION (2010)

期刊

PATTERN RECOGNITION

卷 43, 期 4, 页码 1577-1589

出版社

ELSEVIER SCI LTD

DOI: 10.1016/j.patcog.2009.11.010

关键词

Sequence learning; EM algorithm; Wavelets; Speech recognition

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

National Scientific and Technical Research Council (CONICET)
National Agency for the Promotion of Science and Technology [ANPCyT-UNL PICT 11-25984, ANPCyT-UNL PAE-PICT 52, ANPCyT-UNER PICT 11-12700]
National University of Litoral [UNL CAID 012-72, CAID II R 4-N 14]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Hidden Markov models have been found very useful for a wide range of applications in machine learning and pattern recognition. The wavelet transform has emerged as a new tool for signal and image analysis. Learning models for wavelet coefficients have been mainly based on fixed-length sequences, but real applications often require to model variable-length, very long or real-time sequences. In this paper, we propose a new learning architecture for sequences analyzed on short-term basis, but not assuming stationarity within each frame. Long-term dependencies will be modeled with a hidden Markov model which, in each internal state, will deal with the local dynamics in the wavelet domain, using a hidden Markov tree. The training algorithms for all the parameters in the composite model are developed using the expectation-maximization framework. This novel learning architecture could be useful for a wide range of applications. We detail two experiments with artificial and real data: model-based denoising and speech recognition. Denoising results indicate that the proposed model and learning algorithm are more effective than previous approaches based on isolated hidden Markov trees. In the case of the 'Doppler' benchmark sequence, with 1024 samples and additive white noise, the new method reduced the mean squared error from 1.0 to 0.0842. The proposed methods for feature extraction, modeling and learning, increased the phoneme recognition rates in 28.13%, with better convergence than models based on Gaussian mixtures. (C) 2009 Elsevier Ltd. All rights reserved.

Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees

期刊

PATTERN RECOGNITION

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees

期刊

PATTERN RECOGNITION

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文