4.0 Article

mOTUpan: a robust Bayesian approach to leverage metagenome-assembled genomes for core-genome estimation

期刊

NAR GENOMICS AND BIOINFORMATICS
卷 4, 期 3, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/nargab/lqac060

关键词

-

资金

  1. Swedish Research Council [2017-04422, 2018-04685]
  2. Vinnova [2018-04685] Funding Source: Vinnova
  3. Swedish Research Council [2018-04685, 2017-04422] Funding Source: Swedish Research Council

向作者/读者索取更多资源

Recent advances in sequencing and bioinformatics have expanded the tree of life by providing genomes for uncultured environmentally relevant clades. mOTUpan is a novel method for computing the core genome of highly diverse genome sets. The core-genome prediction of mOTUpan is computationally efficient and can be applied to genomes with lower completeness thresholds.
Recent advances in sequencing and bioinformatics have expanded the tree of life by providing genomes for uncultured environmentally relevant clades, either through metagenome-assembled genomes or through single-cell genomes. While this expanded diversity can provide novel insights into microbial population structure, most tools available for coregenome estimation are sensitive to genome completeness. Consequently, a major portion of the huge phylogenetic diversity uncovered by environmental genomic approaches remains excluded from such analyses. We present mOTUpan, a novel iterative Bayesian method for computing the core genome for sets of genomes of highly diverse completeness range. The likelihood for each gene cluster to belong to core or accessory genome is estimated by computing the probability of its presence/absence pattern in the target genome set. The core-genome prediction is computationally efficient and can be scaled up to thousands of genomes. It has shown comparable estimates to state-of-the-art tools Roary and PPanGGOLiN for high-quality genomes and is capable of using genomes at lower completeness thresholds. mOTUpan wraps a bootstrapping procedure to estimate the quality of a specific core-genome prediction, as the accuracy of each run will depend on the specific completeness distribution and the number of genomes in the dataset under scrutiny. mOTUpan is implemented in the mOTUlizer software package, and available at github.com/moritzbuck/mOTUlizer, under GPL 3.0 license.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据