4.8 Article

Statistical modeling of gut microbiota for personalized health status monitoring

Journal

MICROBIOME
Volume 11, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s40168-023-01614-x

Keywords

Personalized health prediction; Gut microbiome; Principal component analysis; Statistical inference; Machine learning

Categories

Ask authors/readers for more resources

The study developed a statistical monitoring framework to predict and analyze individual health status based on the gut microbiome. It found that the gut microbiome is closely associated with health and identified the contribution of each bacterium to health. The framework can help to understand healthy microbiomes, unhealthy variations, and discover personalized therapy targets.
Background The gut microbiome is closely associated with health status, and any microbiota dysbiosis could considerably impact the host's health. In addition, many active consortium projects have generated many reference datasets available for large-scale retrospective research. However, a comprehensive monitoring framework that analyzes health status and quantitatively present bacteria-to-health contribution has not been thoroughly investigated. Methods We systematically developed a statistical monitoring diagram for personalized health status prediction and analysis. Our framework comprises three elements: (1) a statistical monitoring model was established, the health index was constructed, and the health boundary was defined; (2) healthy patterns were identified among healthy people and analyzed using contrast learning; (3) the contribution of each bacterium to the health index of the diseased population was analyzed. Furthermore, we investigated disease proximity using the contribution spectrum and discovered multiple multi-disease-related targets. Results We demonstrated and evaluated the effectiveness of the proposed monitoring framework for tracking personalized health status through comprehensive real-data analysis using the multi-study cohort and another validation cohort. A statistical monitoring model was developed based on 92 microbial taxa. In both the discovery and validation sets, our approach achieved balanced accuracies of 0.7132 and 0.7026, and AUC of 0.80 and 0.76, respectively. Four health patterns were identified in healthy populations, highlighting variations in species composition and metabolic function across these patterns. Furthermore, a reasonable correlation was found between the proposed health index and host physiological indicators, diversity, and functional redundancy. The health index significantly correlated with Shannon diversity ( rho = 0.07) and species richness ( rho = 0.44) in the healthy samples. However, in samples from individuals with diseases, the health index significantly correlated with age ( rho = 0.12), species richness ( rho = 0.46), and functional redundancy ( rho = -0.16). Personalized diagnosis is achieved by analyzing the contribution of each bacterium to the health index. We identified high-contribution species shared across multiple diseases by analyzing the contribution spectrum of these diseases. Conclusions Our research revealed that the proposed monitoring framework could promote a deep understanding of healthy microbiomes and unhealthy variations and served as a bridge toward individualized therapy target discovery and precise modulation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available