☆ 4.7 Article

Variational autoencoder densified graph attention for fusing synonymous entities: Model and protocol

KNOWLEDGE-BASED SYSTEMS (2023)

期刊

KNOWLEDGE-BASED SYSTEMS

卷 259, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.knosys.2022.110061

关键词

Open knowledge graph; Knowledge graph representation; Cluster ranking; Link prediction

类别

Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes the VGAT model and CR protocol to address the prediction of missing links in open knowledge graphs. The VGAT model automatically mines synonymous features using a variational autoencoder densified graph attention mechanism, while the CR protocol comprehensively evaluates multiple answers from the perspectives of significance and compactness.

The prediction of missing links of open knowledge graphs (OpenKGs) poses unique challenges compared with well-studied curated knowledge graphs (CuratedKGs). Unlike CuratedKGs whose entities are fully disambiguated against a fixed vocabulary, OpenKGs consist of entities represented by non-canonicalized free-form noun phrases and do not require an ontology specification, which drives the synonymity (multiple entities with different surface forms have the same meaning) and sparsity (a large portion of entities with few links). How to capture synonymous features in such sparse situations and how to evaluate the multiple answers pose challenges to existing models and evaluation protocols. In this paper, we propose VGAT, a variational autoencoder densified graph attention model to automatically mine synonymity features, and propose CR, a cluster ranking protocol to evaluate multiple answers in OpenKGs. For the model, VGAT investigates the following key ideas: (1) phrasal synonymity encoder attempts to capture phrasal features, which can automatically make entities with synonymous texts have closer representations; (2) neighbor synonymity encoder mines structural features with a graph attention network, which can recursively make entities with synonymous neighbors closer in representations. (3) densification attempts to densify the OpenKGs by generating similar embeddings and negative samples. For the protocol, CR is designed from the significance and compactness perspectives to comprehensively evaluate multiple answers. Extensive experiments and analysis show the effectiveness of the VGAT model and rationality of the CR protocol. (c) 2022 Elsevier B.V. All rights reserved.

Variational autoencoder densified graph attention for fusing synonymous entities: Model and protocol

期刊

KNOWLEDGE-BASED SYSTEMS

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Variational autoencoder densified graph attention for fusing synonymous entities: Model and protocol

期刊

KNOWLEDGE-BASED SYSTEMS

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文