4.6 Article

Shall I Work with Them? A Knowledge Graph-Based Approach for Predicting Future Research Collaborations

期刊

ENTROPY
卷 23, 期 6, 页码 -

出版社

MDPI
DOI: 10.3390/e23060664

关键词

knowledge graph; link prediction; natural language processing; document representation; future research collaborations; graph kernels; word embeddings

资金

  1. European Union
  2. Greek national funds through the Operational Program Competitiveness, Entrepreneurship and Innovation, under the call RESEARCHCREATE-INNOVATE [T2EDK-04389]

向作者/读者索取更多资源

This study investigates the integration of unstructured textual data into a knowledge graph for predicting future research collaborations, proposing a three-phase pipeline for this purpose. The experimental results demonstrate a significant improvement in performance metrics for predicting future research collaborations on the new COVID-19 Open Research Dataset.
We consider the prediction of future research collaborations as a link prediction problem applied on a scientific knowledge graph. To the best of our knowledge, this is the first work on the prediction of future research collaborations that combines structural and textual information of a scientific knowledge graph through a purposeful integration of graph algorithms and natural language processing techniques. Our work: (i) investigates whether the integration of unstructured textual data into a single knowledge graph affects the performance of a link prediction model, (ii) studies the effect of previously proposed graph kernels based approaches on the performance of an ML model, as far as the link prediction problem is concerned, and (iii) proposes a three-phase pipeline that enables the exploitation of structural and textual information, as well as of pre-trained word embeddings. We benchmark the proposed approach against classical link prediction algorithms using accuracy, recall, and precision as our performance metrics. Finally, we empirically test our approach through various feature combinations with respect to the link prediction problem. Our experimentations with the new COVID-19 Open Research Dataset demonstrate a significant improvement of the abovementioned performance metrics in the prediction of future research collaborations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据