4.6 Article

Stacked cross-modal feature consolidation attention networks for image captioning

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Information Systems

Textual Context-Aware Dense Captioning with Diverse Words

Zhuang Shao et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Review Computer Science, Artificial Intelligence

Neural attention for image captioning: review of outstanding methods

Zanyar Zohourianshahzadi et al.

Summary: This paper reviews literature on attentive deep learning models for image captioning, emphasizing different types of attention mechanisms. The most successful image captioning models follow the encoder-decoder architecture, with the best results currently achieved from variants of multi-head attention with bottom-up attention.

ARTIFICIAL INTELLIGENCE REVIEW (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Scaling Up Vision-Language Pre-training for Image Captioning

Xiaowei Hu et al.

Summary: This paper presents LEMON, a large-scale image captioning model, and conducts an empirical study on the scaling behavior of vision-language pre-training models. The experiments show that LEMON achieves new state-of-the-art results on several image captioning benchmarks and is capable of generating captions with long-tail visual concepts in a zero-shot manner.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Comprehending and Ordering Semantics for Image Captioning

Yehao Li et al.

Summary: This paper proposes a new Transformer-style structure called COS-Net, which integrates semantic comprehension and ordering processes into a single architecture. By utilizing cross-modal retrieval and a semantic ranker, COS-Net achieves superior performance in image captioning tasks.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) (2022)

Article Computer Science, Artificial Intelligence

Hierarchical Deep Neural Network for Image Captioning

Yuting Su et al.

NEURAL PROCESSING LETTERS (2020)

Article Computer Science, Artificial Intelligence

Learning visual relationship and context-aware attention for image captioning

Junbo Wang et al.

PATTERN RECOGNITION (2020)

Article Computer Science, Artificial Intelligence

Image Captioning With End-to-End Attribute Detection and Subsequent Attributes Prediction

Yiqing Huang et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2020)

Article Computer Science, Artificial Intelligence

Image Captioning with Text-Based Visual Attention

Chen He et al.

NEURAL PROCESSING LETTERS (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Look Back and Predict Forward in Image Captioning

Yu Qin et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Article Computer Science, Artificial Intelligence

Aligning Where to See and What to Tell: Image Captioning with Region-Based Attention and Scene-Specific Contexts

Kun Fu et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Article Computer Science, Artificial Intelligence

LSTM: A Search Space Odyssey

Klaus Greff et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Semantic Compositional Networks for Visual Captioning

Zhe Gan et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

Long Chen et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

DenseCap: Fully Convolutional Localization Networks for Dense Captioning

Justin Johnson et al.

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2016)

Article Computer Science, Artificial Intelligence

Selective Search for Object Recognition

J. R. R. Uijlings et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2013)

Article Multidisciplinary Sciences

Top-down versus bottom-up control of attention in the prefrontal and posterior parietal cortices

Timothy J. Buschman et al.

SCIENCE (2007)

Review Neurosciences

Control of goal-directed and stimulus-driven attention in the brain

M Corbetta et al.

NATURE REVIEWS NEUROSCIENCE (2002)