4.8 Article

Rapid identification of high-confidence taxonomic assignments for metagenomic data

Journal

NUCLEIC ACIDS RESEARCH
Volume 40, Issue 14, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gks335

Keywords

-

Funding

  1. Natural Sciences and Engineering Research Council of Canada
  2. Killam Trusts
  3. Genome Atlantic
  4. Canada Foundation for Innovation
  5. Canada Research Chairs program
  6. Government of Canada through Genome Canada
  7. Ontario Genomics Institute [2009-OGI-ABC-1405]
  8. Canadian Natural Sciences and Engineering Research Council

Ask authors/readers for more resources

Determining the taxonomic lineage of DNA sequences is an important step in metagenomic analysis. Short DNA fragments from next-generation sequencing projects and microbes that lack close relatives in reference sequenced genome databases pose significant problems to taxonomic attribution methods. Our new classification algorithm, RITA (Rapid Identification of Taxonomic Assignments), uses the agreement between composition and homology to accurately classify sequences as short as 50 nt in length by assigning them to different classification groups with varying degrees of confidence. RITA is much faster than the hybrid PhymmBL approach when comparable homology search algorithms are used, and achieves slightly better accuracy than PhymmBL on an artificial metagenome. RITA can also incorporate prior knowledge about taxonomic distributions to increase the accuracy of assignments in data sets with varying degrees of taxonomic novelty, and classified sequences with higher precision than the current best rank-flexible classifier. The accuracy on short reads can be increased by exploiting paired-end information, if available, which we demonstrate on a recently published bovine rumen data set. Finally, we develop a variant of RITA that incorporates accelerated homology search techniques, and generate predictions on a set of human gut metagenomes that were previously assigned to different 'enterotypes'. RITA is freely available in Web server and standalone versions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available