Related references
Note: Only part of the references are listed.Deep Learning-based Text Classification: A Comprehensive Review
Shervin Minaee et al.
ACM COMPUTING SURVEYS (2022)
Rich Visual Knowledge-Based Augmentation Network for Visual Question Answering
Liyang Zhang et al.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2021)
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition
Cunhang Fan et al.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)
Text classification using capsules
Jaeyoung Kim et al.
NEUROCOMPUTING (2020)
Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-Modal Retrieval
Jing Yu et al.
IEEE TRANSACTIONS ON MULTIMEDIA (2020)
Multi-Modal Explicit Sparse Attention Networks for Visual Question Answering
Zihan Guo et al.
SENSORS (2020)
TRANSFORMER-BASED ONLINE CTC/ATTENTION END-TO-END SPEECH RECOGNITION ARCHITECTURE
Haoran Miao et al.
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (2020)
An Ensemble of Generation- and Retrieval-Based Image Captioning With Dual Generator Generative Adversarial Network
Min Yang et al.
IEEE TRANSACTIONS ON IMAGE PROCESSING (2020)
Multimodal Encoder-Decoder Attention Networks for Visual Question Answering
Chongqing Chen et al.
IEEE ACCESS (2020)
Unsupervised Neural Machine Translation With Cross-Lingual Language Representation Agreement
Haipeng Sun et al.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2020)
Towards More Diverse Input Representation for Neural Machine Translation
Kehai Chen et al.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2020)
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Liwei Wang et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2019)
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal et al.
INTERNATIONAL JOURNAL OF COMPUTER VISION (2019)
Topic-Oriented Image Captioning Based on Order-Embedding
Niange Yu et al.
IEEE TRANSACTIONS ON IMAGE PROCESSING (2019)
Co-Attention Network With Question Type for Visual Question Answering
Chao Yang et al.
IEEE ACCESS (2019)
VQA: Visual Question Answering
Aishwarya Agrawal et al.
INTERNATIONAL JOURNAL OF COMPUTER VISION (2017)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna et al.
INTERNATIONAL JOURNAL OF COMPUTER VISION (2017)