☆ 4.8 Article

Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models

NATURE METHODS (2009)

期刊

NATURE METHODS

卷 6, 期 9, 页码 673-U68

出版社

NATURE PUBLISHING GROUP

DOI: 10.1038/NMETH.1358

关键词

类别

Biochemical Research Methods

资金

US National Institutes of Health [R01-LM006845, R01-GM083873]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Metagenomics projects collect DNA from uncharacterized environments that may contain thousands of species per sample. One main challenge facing metagenomic analysis is phylogenetic classification of raw sequence reads into groups representing the same or similar taxa, a prerequisite for genome assembly and for analyzing the biological diversity of a sample. New sequencing technologies have made metagenomics easier, by making sequencing faster, and more difficult, by producing shorter reads than previous technologies. Classifying sequences from reads as short as 100 base pairs has until now been relatively inaccurate, requiring researchers to use older, long-read technologies. We present Phymm, a classifier for metagenomic data, that has been trained on 539 complete, curated genomes and can accurately classify reads as short as 100 base pairs, a substantial improvement over previous composition-based classification methods. We also describe how combining Phymm with sequence alignment algorithms improves accuracy.

Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models

期刊

NATURE METHODS

出版社

NATURE PUBLISHING GROUP

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models

期刊

NATURE METHODS

出版社

NATURE PUBLISHING GROUP

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文