4.8 Article

Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2′s q2-feature-classifier plugin

Journal

MICROBIOME
Volume 6, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/s40168-018-0470-z

Keywords

-

Categories

Funding

  1. National Science Foundation [1565100]
  2. Alfred P. Sloan Foundation
  3. Partnership for Native American Cancer Prevention (NIH/NCI) [U54CA143924, U54CA143925]
  4. National Health and Medical Research Council of Australia [APP1085372]
  5. Direct For Biological Sciences
  6. Div Of Biological Infrastructure [1565100] Funding Source: National Science Foundation

Ask authors/readers for more resources

Background: Taxonomic classification of marker-gene sequences is an important step in microbiome analysis. Results: We present q2-feature-classifier (https://github.corn/qiime2/q2 feature classifier), a QIIME 2 plugin containing several novel machine-learning and alignment-based methods for taxonomy classification. We evaluated and optimized several commonly used classification methods implemented in QIIME 1 (RDP, BLAST, UCLUST, and SortMeRNA) and several new methods implemented in QIIME 2 (a scikit-learn naive Bayes machine-learning classifier, and alignment-based taxonomy consensus methods based on VSEARCH, and BLAST+) for classification of bacterial 16S rRNA and fungal ITS marker-gene amplicon sequence data. The naive-Bayes, BLAST+-based, and VSEARCH-based classifiers implemented in QIIME 2 meet or exceed the species-level accuracy of other commonly used methods designed for classification of marker gene sequences that were evaluated in this work. These evaluations, based on 19 mock communities and error-free sequence simulations, including classification of simulated novel marker-gene sequences, are available in our extensible benchmarking framework, tax-credit (https://github.comkaporaso lab/tax credit data). Conclusions: Our results illustrate the importance of parameter tuning for optimizing classifier performance, and we make recommendations regarding parameter choices for these classifiers under a range of standard operating conditions. q2-feature-classifier and tax-credit are both free, open-source, BSD-licensed packages available on GitHub.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available