☆ 4.5 Article

Knowledge graph embedding by fusing multimodal content via cross-modal learning

MATHEMATICAL BIOSCIENCES AND ENGINEERING (2023)

期刊

MATHEMATICAL BIOSCIENCES AND ENGINEERING

卷 20, 期 8, 页码 14180-14200

出版社

AMER INST MATHEMATICAL SCIENCES-AIMS

DOI: 10.3934/mbe.2023634

关键词

knowledge graph; embedding learning; graph embedding; multimodal learning; cross; modal correlation

类别

Mathematical & Computational Biology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Knowledge graph embedding aims to learn representation vectors for entities and relations. Existing approaches mainly use structural information to learn the representation, neglecting content related to entities and relations. In this paper, a multi-modal content fusion model is proposed to effectively fuse heterogeneous data for knowledge graph embedding. Experimental results show that the proposed model outperforms state-of-the-art methods significantly, indicating its superiority.

Knowledge graph embedding aims to learn representation vectors for the entities and relations. Most of the existing approaches learn the representation from the structural information in the triples, which neglects the content related to the entity and relation. Though there are some approaches proposed to exploit the related multimodal content to improve knowledge graph embedding, such as the text description and images associated with the entities, they are not effective to address the heterogeneity and cross-modal correlation constraint of different types of content and network structure. In this paper, we propose a multi-modal content fusion model (MMCF) for knowledge graph embedding. To effectively fuse the heterogenous data for knowledge graph embedding, such as text description, related images and structural information, a cross-modal correlation learning component is proposed. It first learns the intra-modal and inter-modal correlation to fuse the multimodal content of each entity, and then they are fused with the structure features by a gating network. Meanwhile, to enhance the features of relation, the features of the associated head entity and tail entity are fused to learn relation embedding. To effectively evaluate the proposed model, we compare it with other baselines in three datasets, i.e., FB-IMG, WN18RR and FB15k-237. Experiment result of link prediction demonstrates that our model outperforms the state-of-the-art in most of the metrics significantly, implying the superiority of the proposed method.

Knowledge graph embedding by fusing multimodal content via cross-modal learning

期刊

MATHEMATICAL BIOSCIENCES AND ENGINEERING

出版社

AMER INST MATHEMATICAL SCIENCES-AIMS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Knowledge graph embedding by fusing multimodal content via cross-modal learning

期刊

MATHEMATICAL BIOSCIENCES AND ENGINEERING

出版社

AMER INST MATHEMATICAL SCIENCES-AIMS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文