4.7 Article

MorbidGCN: prediction of multimorbidity with a graph convolutional network based on integration of population phenotypes and disease network

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 23, Issue 4, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac255

Keywords

multimorbidity; graph convolutional network; population phenotype; disease network

Funding

  1. National Key Research and Development Program of China [2020YFA0712403]
  2. National Natural Science Foundation of China [61932008]
  3. Shanghai Municipal Science Technology Major Project [2018SHZDZX01]
  4. 111 Project [B18015]

Ask authors/readers for more resources

Exploring multimorbidity relationships among diseases is crucial for precise diagnosis and treatment. MorbidGCN, a graph convolutional network model, integrates population phenotypes and disease networks for multimorbidity prediction and outperforms other methods in identifying new multimorbidities.
Exploring multimorbidity relationships among diseases is of great importance for understanding their shared mechanisms, precise diagnosis and treatment. However, the landscape of multimorbidities is still far from complete due to the complex nature of multimorbidity. Although various types of biological data, such as biomolecules and clinical symptoms, have been used to identify multimorbidities, the population phenotype information (e.g. physical activity and diet) remains less explored for multimorbidity. Here, we present a graph convolutional network (GCN) model, named MorbidGCN, for multimorbidity prediction by integrating population phenotypes and disease network. Specifically, MorbidGCN treats the multimorbidity prediction as a missing link prediction problem in the disease network, where a novel feature selection method is embedded to select important phenotypes. Benchmarking results on two large-scale multimorbidity data sets, i.e. the UK Biobank (UKB) and Human Disease Network (HuDiNe) data sets, demonstrate that MorbidGCN outperforms other competitive methods. With MorbidGCN, 9742 and 14 010 novel multimorbidities are identified in the UKB and HuDiNe data sets, respectively. Moreover, we notice that the selected phenotypes that are generally differentially distributed between multimorbidity patients and single-disease patients can help interpret multimorbidities and show potential for prognosis of multimorbidities.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available