4.7 Article

Learning representations for gene ontology terms by jointly encoding graph structure and textual node descriptors

期刊

BRIEFINGS IN BIOINFORMATICS
卷 23, 期 5, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac318

关键词

ontology term; representation learning; semantic similarity; graph contrastive learning

资金

  1. National Natural Science Foundation of China (NSFC) [62171164, 62102191, 61872114]
  2. National Key Research & Development Plan of the Ministry of Science and Technology of China [2018YFC1314900, 2018YFC1314901]

向作者/读者索取更多资源

This paper proposes a novel representation model for GO terms, named GT2Vec, which considers both the GO graph structure obtained by graph contrastive learning and the semantic description of GO terms based on BERT encoders. Experimental results demonstrate the effectiveness of the model in learning vector representations for GO terms.
Measuring the semantic similarity between Gene Ontology (GO) terms is a fundamental step in numerous functional bioinformatics applications. To fully exploit the metadata of GO terms, word embedding-based methods have been proposed recently to map GO terms to low-dimensional feature vectors. However, these representation methods commonly overlook the key information hidden in the whole GO structure and the relationship between GO terms. In this paper, we propose a novel representation model for GO terms, named GT2Vec, which jointly considers the GO graph structure obtained by graph contrastive learning and the semantic description of GO terms based on BERT encoders. Our method is evaluated on a protein similarity task on a collection of benchmark datasets. The experimental results demonstrate the effectiveness of using a joint encoding graph structure and textual node descriptors to learn vector representations for GO terms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据