4.7 Article

PaSiT: a novel approach based on short-oligonucleotide frequencies for efficient bacterial identification and typing

期刊

BIOINFORMATICS
卷 36, 期 8, 页码 2337-2344

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btz964

关键词

-

资金

  1. BCCM/LMG Bacteria Collection
  2. Belgian Nuclear Research Centre (SCK_CEN)
  3. Federal Public Planning Service-Science Policy, Belgium

向作者/读者索取更多资源

Motivation: One of the most widespread methods used in taxonomy studies to distinguish between strains or taxa is the calculation of average nucleotide identity. It requires a computationally expensive alignment step and is therefore not suitable for large-scale comparisons. Short oligonucleotide-based methods do offer a faster alternative but at the expense of accuracy. Here, we aim to address this shortcoming by providing a software that implements a novel method based on short-oligonucleotide frequencies to compute inter-genomic distances. Results: Our tetranucleotide and hexanucleotide implementations, which were optimized based on a taxonomically well-defined set of over 200 newly sequenced bacterial genomes, are as accurate as the short oligonucleotide-based method TETRA and average nucleotide identity, for identifying bacterial species and strains, respectively. Moreover, the lightweight nature of this method makes it applicable for large-scale analyses.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据