4.7 Article

AMAM: An Attention-based Multimodal Alignment Model for Medical Visual Question Answering

Related references

Note: Only part of the references are listed.
Article Multidisciplinary Sciences

MedFuseNet: An attention-based multimodal deep learning model for visual question answering in the medical domain

Dhruv Sharma et al.

Summary: Medical images are complex for non-experts, and healthcare professionals can face fatigue and errors due to high case loads. Therefore, developing a reliable VQA system specific to medical images is crucial for enhancing confidence among decision makers.

SCIENTIFIC REPORTS (2021)

Article Computer Science, Artificial Intelligence

Visual Question Answering via Combining Inferential Attention and Semantic Space Mapping

Yun Liu et al.

KNOWLEDGE-BASED SYSTEMS (2020)

Article Computer Science, Interdisciplinary Applications

A Question-Centric Model for Visual Question Answering in Medical Imaging

Minh H. Vu et al.

IEEE TRANSACTIONS ON MEDICAL IMAGING (2020)

Article Computer Science, Artificial Intelligence

Beyond Bilinear: Generalized Multimodal Factorized High-Order Pooling for Visual Question Answering

Zhou Yu et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2018)

Article Multidisciplinary Sciences

A dataset of clinically generated visual questions and answers about radiology images

Jason J. Lau et al.

SCIENTIFIC DATA (2018)

Article Computer Science, Artificial Intelligence

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Proceedings Paper Computer Science, Artificial Intelligence

VQA: Visual Question Answering

Stanislaw Antol et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)