4.7 Article

Principal microbial groups: compositional alternative to phylogenetic grouping of microbiome data

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 23, Issue 5, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac328

Keywords

microbiome; compositional data; balance; microbial biomarkers

Funding

  1. Scientific and Technological Research Council of Turkey [1059B141601395]
  2. Spanish Ministry of Science, Innovation and Universities
  3. European Regional

Ask authors/readers for more resources

This study presents a novel approach that groups Operational Taxonomical Units (OTUs) based on relative abundances using principal balances, providing an alternative to taxon grouping. The proposed method has potential applications in dimensionality reduction and construction of microbial balances for disease prediction, offering a coherent data analysis for biomarker discovery in human microbiota.
Statistical and machine learning techniques based on relative abundances have been used to predict health conditions and to identify microbial biomarkers. However, high dimensionality, sparsity and the compositional nature of microbiome data represent statistical challenges. On the other hand, the taxon grouping allows summarizing microbiome abundance with a coarser resolution in a lower dimension, but it presents new challenges when correlating taxa with a disease. In this work, we present a novel approach that groups Operational Taxonomical Units (OTUs) based only on relative abundances as an alternative to taxon grouping. The proposed procedure acknowledges the compositional data making use of principal balances. The identified groups are called Principal Microbial Groups (PMGs). The procedure reduces the need for user-defined aggregation of (OTUs) and offers the possibility of working with coarse group of OTUs, which are not present in a phylogenetic tree. PMGs can be used for two different goals: (1) as a dimensionality reduction method for compositional data, (2) as an aggregation procedure that provides an alternative to taxon grouping for construction of microbial balances afterward used for disease prediction. We illustrate the procedure with a cirrhosis study data. PMGs provide a coherent data analysis for the search of biomarkers in human microbiota. The source code and demo data for PMGs are available at: https://github.com/asliboyraz/PMGs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available