相关参考文献
注意:仅列出部分参考文献,下载原文获取全部文献信息。Sequential Transformer via an Outside-In Attention for image captioning
Yiwei Wei et al.
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2022)
Dual Global Enhanced Transformer for image captioning
Tiantao Xian et al.
NEURAL NETWORKS (2022)
LaTr: Layout-Aware Transformer for Scene-Text VQA
Ali Furkan Biten et al.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) (2022)
Multimodal Transformer With Multi-View Visual Representation for Image Captioning
Jun Yu et al.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2020)
Show, Edit and Tell: A Framework for Editing Image Captions
Fawaz Sammani et al.
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2020)
Boosted Transformer for Image Captioning
Jiangyun Li et al.
APPLIED SCIENCES-BASEL (2019)
Captioning Transformer with Stacked Attention Modules
Xinxin Zhu et al.
APPLIED SCIENCES-BASEL (2018)
Deep Visual-Semantic Alignments for Generating Image Descriptions
Andrej Karpathy et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)