4.7 Article

Fusing visual and textual content for knowledge graph embedding via dual-track model

期刊

APPLIED SOFT COMPUTING
卷 128, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.asoc.2022.109524

关键词

Knowledge graph embedding; Multi -modal fusion; Link prediction; Knowledge graph completion

资金

  1. Beijing Natural Science Foundation of China [4182037]
  2. National Natural Science Foundation of China [U1636210, U1636211]
  3. Science and Technology Plan of Beijing Municipal Education Commission, PR China [KM202010017011]
  4. Open Re- search Fund from Shenzhen Research Institute of Big Data, PR China [2019ORF01012]

向作者/读者索取更多资源

In this work, a dual-track model DuMF is proposed for enhancing knowledge graph embedding. The model fuses multi-modal content and network structure information through two tracks, improving the expressiveness of joint features and learning task-specific important features. Experimental results demonstrate that the model outperforms baselines in link prediction and exhibits promising flexibility for further improvement.
Large-scale knowledge graphs are usually incomplete. Knowledge graph embedding has achieved encouraging performance in alleviating the incompleteness of knowledge graphs. There are approaches to leverage the multi-modal content, such as text description and images, to improve the performance of knowledge graph embedding. However, due to the heterogeneity across different modalities, current methods are not effective to fuse the multi-modal content and network structure information to learn the embedding. In this work, a dual-track model DuMF for knowledge graph embedding enhancement is proposed. The model includes two tracks to fuse multi-modal content and network structure respectively. In each track, the expressiveness of joint features is improved by the bilinear method, and meanwhile the task-specific important features are learned by deliberate attention. Finally, the fused features are generated by the gating network. To extensively evaluate the model, two challenging datasets are enriched with additional multi-modal data. Experimental results show that DuMF is superior to baselines in link prediction. The flexibility of the model is promising for the further improvement of model performance. (C) 2022 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据