4.7 Article

CoGO: a contrastive learning framework to predict disease similarity based on gene network and ontology structure

Journal

BIOINFORMATICS
Volume 38, Issue 18, Pages 4380-4386

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac520

Keywords

-

Funding

  1. National Natural Sciences Foundation of China [32070677]
  2. 151 Talent Project of Zhejiang Province
  3. Jiangsu Collaborative Innovation Center for Modern Crop Production and Collaborative Innovation Center for Modern Crop Production

Ask authors/readers for more resources

This paper presents a contrastive learning framework CoGO, which uses deep learning models to extract and incorporate biological data for predicting disease similarity. Experimental results show that CoGO outperforms other methods in terms of prediction performance and provides highly credible disease similarity results compared to other studies.
Motivation: Quantifying the similarity of human diseases provides guiding insights to the discovery of micro-scope mechanisms from a macro scale. Previous work demonstrated that better performance can be gained by integrating multiview data sources or applying machine learning techniques. However, designing an efficient framework to extract and incorporate information from different biological data using deep learning models remains unexplored. Results: We present CoGO, a Contrastive learning framework to predict disease similarity based on Gene network and Ontology structure, which incorporates the gene interaction network and gene ontology (GO) domain knowledge using graph deep learning models. First, graph deep learning models are applied to encode the features of genes and GO terms from separate graph structure data. Next, gene and GO features are projected to a common embedding space via a nonlinear projection. Then cross-view contrastive loss is applied to maximize the agreement of corresponding gene-GO associations and lead to meaningful gene representation. Finally, CoGO infers the similarity between diseases by the cosine similarity of disease representation vectors derived from related gene embedding. In our experiments, CoGO outperforms the most competitive baseline method on both AUROC and AUPRC, especially improves 19.57% in AUPRC (0.7733). The prediction results are significantly comparable with other disease similarity studies and thus highly credible. Furthermore, we conduct a detailed case study of top similar disease pairs which is demonstrated by other studies. Empirical results show that CoGO achieves powerful performance in disease similarity problem.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available