4.6 Article

Joint embedding VQA model based on dynamic word vector

期刊

PEERJ COMPUTER SCIENCE
卷 -, 期 -, 页码 -

出版社

PEERJ INC
DOI: 10.7717/peerj-cs.353

关键词

Faster R-CNN; ELMo; MA; VQA

资金

  1. Sichuan Science and Technology Program [2019YJ0189]
  2. Fundamental Research Funds for the Central Universities [ZYGX2019J059]

向作者/读者索取更多资源

This article introduces a joint embedding model based on dynamic word vector-N-KBSN model, which can better express semantic and grammatical differences compared to existing Visual Question Answering models based on static word vectors. Experimental results show that the accuracy of this model significantly improves.
The existing joint embedding Visual Question Answering models use different combinations of image characterization, text characterization and feature fusion method, but all the existing models use static word vectors for text characterization. However, in the real language environment, the same word may represent different meanings in different contexts, and may also be used as different grammatical components. These differences cannot be effectively expressed by static word vectors, so there may be semantic and grammatical deviations. In order to solve this problem, our article constructs a joint embedding model based on dynamic word vector-none KB-Specific network (N-KBSN) model which is different from commonly used Visual Question Answering models based on static word vectors. The N-KBSN model consists of three main parts: question text and image feature extraction module, self attention and guided attention module, feature fusion and classifier module. Among them, the key parts of N-KBSN model are: image characterization based on Faster R-CNN, text characterization based on ELMo and feature enhancement based on multi-head attention mechanism. The experimental results show that the N-KBSN constructed in our experiment is better than the other 2017-winner (glove) model and 2019-winner (glove) model. The introduction of dynamic word vector improves the accuracy of the overall results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据