4.5 Article

bioOTU: An Improved Method for Simultaneous Taxonomic Assignments and Operational Taxonomic Units Clustering of 16s rRNA Gene Sequences

Journal

JOURNAL OF COMPUTATIONAL BIOLOGY
Volume 23, Issue 4, Pages 229-238

Publisher

MARY ANN LIEBERT, INC
DOI: 10.1089/cmb.2015.0214

Keywords

16s rRNA; operational taxonomic units.; next-generation sequencing

Funding

  1. National Natural Science Foundation of China [31172197]
  2. Sichuan Province [2015R20026]

Ask authors/readers for more resources

Clustering of 16s rRNA amplicon sequences into operational taxonomic units (OTUs) is the most common bioinformatics pipeline for investigating microbial community by high-throughput sequencing technologies. However, the existing algorithms of OTUs clustering still remain to be improved at reliability. Here we propose an improved method (bioOTU) that first assigns taxonomy to unique tags at genus level for separating the error-free sequences of known species in reference database from artifacts, and then cluster them into OTUs by different strategies. The remaining tags, which fail to be clustered in the previous step, are further subjected to independent OTUs clustering by the optimized algorithm of heuristic clustering. The performance tests on both mock and real communities revealed that bioOTU is powerful for recovering the underlying profiles at both microbial composition and abundance, and it also produces comparable or less number of OTUs in comparison with the prevailing tools of Mothur and UPARSE. The bioOTU is implemented in C and Python languages with source codes freely available on the GitHub repository.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available