4.6 Article

Infrared Image Caption Based on Object-Oriented Attention

Related references

Note: Only part of the references are listed.
Proceedings Paper Geosciences, Multidisciplinary

CAPFORMER: PURE TRANSFORMER FOR REMOTE SENSING IMAGE CAPTION

Junjue Wang et al.

Summary: This paper proposes a pure Transformer (CapFormer) architecture for accurately describing high-spatial resolution remote sensing images. By adopting a scalable vision Transformer and a Transformer decoder, CapFormer outperforms the state-of-the-art image caption methods in summarizing complex scenes.

2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022) (2022)

Article Computer Science, Artificial Intelligence

Topic-Oriented Image Captioning Based on Order-Embedding

Niange Yu et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2019)

Proceedings Paper Computer Science, Hardware & Architecture

WHAT TOPICS DO IMAGES SAY: A NEURAL IMAGE CAPTIONING MODEL WITH TOPIC REPRESENTATION

Feng Chen et al.

2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW) (2019)

Article Computer Science, Artificial Intelligence

Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge

Oriol Vinyals et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Article Computer Science, Artificial Intelligence

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Proceedings Paper Computer Science, Artificial Intelligence

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

Long Chen et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)