4.7 Article

MB-GAN: Microbiome Simulation via Generative Adversarial Network

期刊

GIGASCIENCE
卷 10, 期 2, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/gigascience/giab005

关键词

microbiome simulation; generative adversarial network; deep learning

资金

  1. National Institutes of Health [5P30CA142543, 5R01GM126479, 5R01HG008983, 1R56HG011035]

向作者/读者索取更多资源

MB-GAN is a novel simulation framework designed using generative adversarial networks to generate microbiome data that are indistinguishable from real data. Compared to traditional methods, MB-GAN does not require explicit statistical modeling assumptions and is easily applicable.
Background: Trillions of microbes inhabit the human body and have a profound effect on human health. The recent development of metagenome-wide association studies and other quantitative analysis methods accelerate the discovery of the associations between human microbiome and diseases. To assess the strengths and limitations of these analytical tools, simulating realistic microbiome datasets is critically important. However, simulating the real microbiome data is challenging because it is difficult to model their correlation structure using explicit statistical models. Results: To address the challenge of simulating realistic microbiome data, we designed a novel simulation framework termed MB-GAN, by using a generative adversarial network (GAN) and utilizing methodology advancements from the deep learning community. MB-GAN can automatically learn from given microbial abundances and compute simulated abundances that are indistinguishable from them. In practice, MB-GAN showed the following advantages. First, MB-GAN avoids explicit statistical modeling assumptions, and it only requires real datasets as inputs. Second, unlike the traditional GANs, MB-GAN is easily applicable and can converge efficiently. Conclusions: By applying MB-GAN to a case-control gut microbiome study of 396 samples, we demonstrated that the simulated data and the original data had similar first-order and second-order properties, including sparsity, diversities, and taxa-taxa correlations. These advantages are suitable for further microbiome methodology development where high-fidelity microbiome data are needed.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据