4.5 Article

Knowledge graph embedding by fusing multimodal content via cross-modal learning

期刊

MATHEMATICAL BIOSCIENCES AND ENGINEERING
卷 20, 期 8, 页码 14180-14200

出版社

AMER INST MATHEMATICAL SCIENCES-AIMS
DOI: 10.3934/mbe.2023634

关键词

knowledge graph; embedding learning; graph embedding; multimodal learning; cross; modal correlation

向作者/读者索取更多资源

Knowledge graph embedding aims to learn representation vectors for entities and relations. Existing approaches mainly use structural information to learn the representation, neglecting content related to entities and relations. In this paper, a multi-modal content fusion model is proposed to effectively fuse heterogeneous data for knowledge graph embedding. Experimental results show that the proposed model outperforms state-of-the-art methods significantly, indicating its superiority.
Knowledge graph embedding aims to learn representation vectors for the entities and relations. Most of the existing approaches learn the representation from the structural information in the triples, which neglects the content related to the entity and relation. Though there are some approaches proposed to exploit the related multimodal content to improve knowledge graph embedding, such as the text description and images associated with the entities, they are not effective to address the heterogeneity and cross-modal correlation constraint of different types of content and network structure. In this paper, we propose a multi-modal content fusion model (MMCF) for knowledge graph embedding. To effectively fuse the heterogenous data for knowledge graph embedding, such as text description, related images and structural information, a cross-modal correlation learning component is proposed. It first learns the intra-modal and inter-modal correlation to fuse the multimodal content of each entity, and then they are fused with the structure features by a gating network. Meanwhile, to enhance the features of relation, the features of the associated head entity and tail entity are fused to learn relation embedding. To effectively evaluate the proposed model, we compare it with other baselines in three datasets, i.e., FB-IMG, WN18RR and FB15k-237. Experiment result of link prediction demonstrates that our model outperforms the state-of-the-art in most of the metrics significantly, implying the superiority of the proposed method.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据