4.7 Article

Combining embedding-based and symbol-based methods for entity alignment

期刊

PATTERN RECOGNITION
卷 124, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2021.108433

关键词

Entity alignment; Knowledge graph embedding; String Similarity

资金

  1. National Key Research and Development Program of China [2016YFB1000901]
  2. National Natural Science Foundation of China [6180 60 65, 91746209]
  3. funds for International Cooperation and Exchange of the National Natural Science Foundation of China [62120106008]
  4. Fundamental Research Funds for the Central Universities [JZ2020HGQA0186]

向作者/读者索取更多资源

The objective of entity alignment is to determine if entities refer to the same object in the real world. This paper introduces two groups of methods, symbol-based and embedding-based, for entity alignment. It proposes a promising strategy by combining the advantages of both methods and presents an improved algorithm, ESEA. Experimental results demonstrate that ESEA outperforms other embedding-based methods and the previous RTEA method.
The objective of entity alignment is to judge whether entities refer to the same object in the real world. Methods for entity alignment can be grossly divided into two groups: conventional symbol-based entity alignment methods and embedding-based entity alignment methods. Both groups of methods have advantages and disadvantages (which are detailed in Section 1). Therefore, combining the advantages of both methods might be a promising strategy. However, to the best of our knowledge, only the RTEA algorithm that was proposed in our previous conference paper (Proceeding of Pacific Rim International Conference on Artificial Intelligence, pp. 162-175, 2019) utilizes this strategy for entity alignment. This manuscript is an extended version of that conference paper, in which an improved algorithm, namely, ESEA (combining embedding-based and symbol-based methods for entity alignment), is proposed based on the following steps. First, a novel method for combining embedding models with symbol-based models is proposed. Entities with high vector similarities are obtained through a hybrid embedding model, and the final aligned entity pairs are calculated via symbol-based methods. Second, a series of symbol based methods, instead of only the edit distance method in the original version, are combined with embedding-based methods for relation alignment. Third, we combine symbol-based and embedding based methods in a more complicated framework with the objective of better exploiting the advantages of both methods. The experimental results on real-world datasets demonstrate that the proposed method outperformed several state-of-the-art embedding-based entity alignment approaches and outperformed our previous RTEA method.(c) 2021 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据