4.7 Article

Principal microbial groups: compositional alternative to phylogenetic grouping of microbiome data

期刊

BRIEFINGS IN BIOINFORMATICS
卷 23, 期 5, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac328

关键词

microbiome; compositional data; balance; microbial biomarkers

资金

  1. Scientific and Technological Research Council of Turkey [1059B141601395]
  2. Spanish Ministry of Science, Innovation and Universities
  3. European Regional

向作者/读者索取更多资源

This study presents a novel approach that groups Operational Taxonomical Units (OTUs) based on relative abundances using principal balances, providing an alternative to taxon grouping. The proposed method has potential applications in dimensionality reduction and construction of microbial balances for disease prediction, offering a coherent data analysis for biomarker discovery in human microbiota.
Statistical and machine learning techniques based on relative abundances have been used to predict health conditions and to identify microbial biomarkers. However, high dimensionality, sparsity and the compositional nature of microbiome data represent statistical challenges. On the other hand, the taxon grouping allows summarizing microbiome abundance with a coarser resolution in a lower dimension, but it presents new challenges when correlating taxa with a disease. In this work, we present a novel approach that groups Operational Taxonomical Units (OTUs) based only on relative abundances as an alternative to taxon grouping. The proposed procedure acknowledges the compositional data making use of principal balances. The identified groups are called Principal Microbial Groups (PMGs). The procedure reduces the need for user-defined aggregation of (OTUs) and offers the possibility of working with coarse group of OTUs, which are not present in a phylogenetic tree. PMGs can be used for two different goals: (1) as a dimensionality reduction method for compositional data, (2) as an aggregation procedure that provides an alternative to taxon grouping for construction of microbial balances afterward used for disease prediction. We illustrate the procedure with a cirrhosis study data. PMGs provide a coherent data analysis for the search of biomarkers in human microbiota. The source code and demo data for PMGs are available at: https://github.com/asliboyraz/PMGs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据