4.7 Article

phyLoSTM: a novel deep learning model on disease prediction from longitudinal microbiome data

期刊

BIOINFORMATICS
卷 37, 期 21, 页码 3707-3714

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab482

关键词

-

资金

  1. Natural Sciences and Engineering Research Council of Canada (NSERC) [RGPIN-2017-06672]
  2. Crohn's and Colitis Canada (CCC Grant CCC-GEMIII)
  3. Helmsley Charitable Trust
  4. NSERC [RGPIN-2017-06672]

向作者/读者索取更多资源

This study introduces a novel deep learning framework 'phyLoSTM' for analyzing temporal dependency in longitudinal microbiome sequencing data and predicting diseases in relation to host's environmental factors. Results show promising performance in simulated and real microbiome studies.
Motivation: Research shows that human microbiome is highly dynamic on longitudinal timescales, changing dynamically with diet, or due to medical interventions. In this article, we propose a novel deep learning framework 'phyLoSTM', using a combination of Convolutional Neural Networks and Long Short Term Memory Networks (LSTM) for feature extraction and analysis of temporal dependency in longitudinal microbiome sequencing data along with host's environmental factors for disease prediction. Additional novelty in terms of handling variable timepoints in subjects through LSTMs, as well as, weight balancing between imbalanced cases and controls is proposed. Results: We simulated 100 datasets across multiple time points for model testing. To demonstrate the model's effectiveness, we also implemented this novel method into two real longitudinal human microbiome studies: (i) DIABIMMUNE three country cohort with food allergy outcomes (Milk, Egg, Peanut and Overall) and (ii) DiGiulio study with preterm delivery as outcome. Extensive analysis and comparison of our approach yields encouraging performance with an AUC of 0.897 (increased by 5%) on simulated studies and AUCs of 0.762 (increased by 19%) and 0.713 (increased by 8%) on the two real longitudinal microbiome studies respectively, as compared to the next best performing method, Random Forest. The proposed methodology improves predictive accuracy on longitudinal human microbiome studies containing spatially correlated data, and evaluates the change of microbiome composition contributing to outcome prediction. Supplementary information: Supplementary data are available at Bioinformatics online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据