4.6 Article

Vision-to-Language Tasks Based on Attributes and Attention Mechanism

Related references

Note: Only part of the references are listed.
Article Computer Science, Artificial Intelligence

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection

Gong Cheng et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2019)

Article Engineering, Electrical & Electronic

A Unified Metric Learning-Based Framework for Co-Saliency Detection

Junwei Han et al.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2018)

Article Automation & Control Systems

CNNs-Based RGB-D Saliency Detection via Cross-View Transfer and Multiview Fusion

Junwei Han et al.

IEEE TRANSACTIONS ON CYBERNETICS (2018)

Article Geochemistry & Geophysics

Exploring Models and Data for Remote Sensing Image Caption Generation

Xiaoqiang Lu et al.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2018)

Article Computer Science, Artificial Intelligence

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge

Qi Wu et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2018)

Article Computer Science, Artificial Intelligence

Aligning Where to See and What to Tell: Image Captioning with Region-Based Attention and Scene-Specific Contexts

Kun Fu et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description

Xishan Zhang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Attend to You: Personalized Image Captioning with Context Sequence Memory Networks

Cesc Chunseong Park et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering

Yunseok Jang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Hierarchical Boundary-Aware Neural Encoder for Video Captioning

Lorenzo Baraldi et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Article Computer Science, Artificial Intelligence

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2015)

Proceedings Paper Computer Science, Artificial Intelligence

VQA: Visual Question Answering

Stanislaw Antol et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Article Computer Science, Hardware & Architecture

Joint Video and Text Parsing for Understanding Events and Answering Queries

Kewei Tu et al.

IEEE MULTIMEDIA (2014)

Article Computer Science, Artificial Intelligence

BabyTalk: Understanding and Generating Simple Image Descriptions

Girish Kulkarni et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2013)

Article Computer Science, Artificial Intelligence

Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics

Micah Hodosh et al.

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH (2013)