4.7 Article

Optimizing UniFrac with OpenACC Yields Greater Than One Thousand Times Speed Increase

Journal

MSYSTEMS
Volume 7, Issue 3, Pages -

Publisher

AMER SOC MICROBIOLOGY
DOI: 10.1128/msystems.00028-22

Keywords

microbiome; GPU; OpenACC; optimization; UniFrac

Categories

Funding

  1. U.S. National Science Foundation (NSF) [DBI-2038509, OAC-1826967, OAC-1541349, CNS-1730158]
  2. U.S. National Institutes of Health (NIH) [DP1-AT010885]
  3. CRISP, one of six centers in JUMP, a Semiconductor Research Corporation (SRC) program - DARPA [GI18518]

Ask authors/readers for more resources

UniFrac is a widely used tool in microbiome research for comparing microbiome profiles. This study adapts UniFrac to be used on graphics processing units, resulting in a significant improvement in computational performance. The tool is successfully applied to the largest 16S rRNA V4 microbiome dataset analyzed to date.
UniFrac is an important tool in microbiome research that is used for phylogenetically comparing microbiome profiles to one another (beta diversity). Striped UniFrac recently added the ability to split the problem into many independent subproblems, exhibiting nearly linear scaling but suffering from memory contention. Here, we adapt UniFrac to graphics processing units using OpenACC, enabling greater than 1,000x computational improvement, and apply it to 307,237 samples, the largest 16S rRNA V4 uniformly preprocessed microbiome data set analyzed to date. IMPORTANCE UniFrac is an important tool in microbiome research that is used for phylogenetically comparing microbiome profiles to one another. Here, we adapt UniFrac to operate on graphics processing units, enabling a 1,000x computational improvement. To highlight this advance, we perform what may be the largest microbiome analysis to date, applying UniFrac to 307,237 16S rRNA V4 microbiome samples preprocessed with Deblur. These scaling improvements turn UniFrac into a real-time tool for common data sets and unlock new research questions as more microbiome data are collected.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available