4.7 Article

Augmented training of hidden Markov models to recognize remote homologs via simulated evolution

期刊

BIOINFORMATICS
卷 25, 期 13, 页码 1602-1608

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btp265

关键词

-

资金

  1. National Institutes of Health [1R01GM080330-01A1]

向作者/读者索取更多资源

Motivation: While profile hidden Markov models (HMMs) are successful and powerful methods to recognize homologous proteins, they can break down when homology becomes too distant due to lack of sufficient training data. We show that we can improve the performance of HMMs in this domain by using a simple simulated model of evolution to create an augmented training set. Results: We show, in two different remote protein homolog tasks, that HMMs whose training is augmented with simulated evolution outperform HMMs trained only on real data. We find that a mutation rate between 15 and 20% performs best for recognizing G-protein coupled receptor proteins in different classes, and for recognizing SCOP super-family proteins from different families.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据