☆ 4.5 Article

Hierarchical Deep Neural Network for Image Captioning

NEURAL PROCESSING LETTERS (2020)

期刊

NEURAL PROCESSING LETTERS

卷 52, 期 2, 页码 1057-1067

出版社

SPRINGER

DOI: 10.1007/s11063-019-09997-5

关键词

Regional semantic; Image captioning; Attention mechanism

类别

Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Automatically describing image content with natural language is a fundamental challenge for computer vision community. General methods used visual information to generate sentences directly. However, only depending on the visual information is not enough to generate the fine-grained descriptions for given images. In this paper, we exploit the fusion of visual information and high-level semantic information for image captioning. We propose a hierarchical deep neural network, which consists of the bottom layer and the top layer. The former extracts the visual and high-level semantic information from image and detected regions, respectively, while the latter integrates both of them with adaptive attention mechanism for the caption generation. The experimental results achieve the competing performances against the state-of-the-art methods on MSCOCO dataset.

Hierarchical Deep Neural Network for Image Captioning

期刊

NEURAL PROCESSING LETTERS

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Hierarchical Deep Neural Network for Image Captioning

期刊

NEURAL PROCESSING LETTERS

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文