4.8 Article

Emu: species-level microbial community profiling of full-length 16S rRNA Oxford Nanopore sequencing data

Journal

NATURE METHODS
Volume 19, Issue 7, Pages 845-+

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41592-022-01520-4

Keywords

-

Funding

  1. Jurgen Manchot Foundation
  2. Deutsche Forschungsgemeinschaft (DFG) [428994620]
  3. NIH grant from NIDDK [P30-DK56338]
  4. NIH from the National Institute for Neurological Disorders and Stroke (NINDS) [R21NS106640]
  5. Ken Kennedy Institute Computational Science and Engineering Graduate Recruiting Fellowship
  6. NIH from the National Institute of Allergy and Infectious Diseases (NIAID) [P01-AI152999]
  7. NSF [EF-2126387]
  8. National Library of Medicine Training Program in Biomedical Informatics and Data Science [T15LM007093]
  9. NIH grant from NIAID [P01-AI152999, NIAID R01-AI10091401, U01-AI24290]
  10. NIH grant from NINR [R01-NR013497]

Ask authors/readers for more resources

16S ribosomal RNA-based analysis is the standard method for studying microbial community composition. Full-length 16S rRNA gene sequences have the potential to provide species-level accuracy. Emu is an approach that can generate taxonomic abundance profiles from full-length 16S rRNA reads, accurately estimating microbial abundance.
16S ribosomal RNA-based analysis is the established standard for elucidating the composition of microbial communities. While short-read 16S rRNA analyses are largely confined to genus-level resolution at best, given that only a portion of the gene is sequenced, full-length 16S rRNA gene amplicon sequences have the potential to provide species-level accuracy. However, existing taxonomic identification algorithms are not optimized for the increased read length and error rate often observed in long-read data. Here we present Emu, an approach that uses an expectation-maximization algorithm to generate taxonomic abundance profiles from full-length 16S rRNA reads. Results produced from simulated datasets and mock communities show that Emu is capable of accurate microbial community profiling while obtaining fewer false positives and false negatives than alternative methods. Additionally, we illustrate a real-world application of Emu by comparing clinical sample composition estimates generated by an established whole-genome shotgun sequencing workflow with those returned by full-length 16S rRNA gene sequences processed with Emu. Emu accurately estimates microbial abundance using full-length Nanopore 16S rRNA gene sequencing data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available