4.6 Article

Multi-modal co-attention relation networks for visual question answering

Related references

Note: Only part of the references are listed.
Article Computer Science, Hardware & Architecture

A Traceable and Revocable Ciphertext-Policy Attribute-based Encryption Scheme Based on Privacy Protection

Dezhi Han et al.

Summary: The proposed CP-ABE scheme in this article achieves revocation, white-box traceability, and the application of hidden policy. The ciphertext is composed of two parts: the access policy encrypted by attribute value and the revocation information related to a binary tree. The scheme is proven to be IND-CPA secure, efficient, and promising in the standard model.

IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING (2022)

Article Automation & Control Systems

A Blockchain-Based Auditable Access Control System for Private Data in Service-Centric IoT Environments

Dezhi Han et al.

Summary: This article proposes an auditable access control model based on attribute-based access control for managing private data's access control policy in IoT environments. A blockchain-based auditable access control system is also introduced to ensure data security and auditable access in IoT environments. Experimental results demonstrate high throughput and data security for real application scenarios.

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS (2022)

Article Computer Science, Information Systems

A Privacy-Preserving Storage Scheme for Logistics Data With Assistance of Blockchain

Hongzhi Li et al.

Summary: In this study, a blockchain-assisted secure storage scheme is proposed for logistics data protection. The scheme utilizes a blockchain network to provide reliable storage interfaces and introduces an efficient consensus mechanism to improve the efficiency of the consensus process. Experimental results indicate that the performance of the scheme is acceptable.

IEEE INTERNET OF THINGS JOURNAL (2022)

Article Computer Science, Software Engineering

SPCA-Net: a based on spatial position relationship co-attention network for visual question answering

Feng Yan et al.

Summary: This paper proposes an effective deep co-attention network that addresses the issue of VQA models not considering the spatial relationship between image region features. By introducing BERT and spatial location relationship, the model enables fine-grained interactions between question and image.

VISUAL COMPUTER (2022)

Article Computer Science, Software Engineering

Multiple answers to a question: a new approach for visual question answering

Sayedshayan Hashemi Hosseinabad et al.

Summary: This paper introduces a new approach to address the multiple-answer VQA problem, provides a new dataset for model evaluation, and our proposed model significantly reduces operations by 94% compared to other models, making it ideal for real-time applications.

VISUAL COMPUTER (2021)

Article Computer Science, Artificial Intelligence

Cross-modality co-attention networks for visual question answering

Dezhi Han et al.

Summary: Visual question answering (VQA) is an emerging task that combines natural language processing and computer vision technology. The proposed cross-modality co-attention network (CMCN) framework aims to improve learning both intra-modal and cross-modal relationships with a core module called cross-modality co-attention (CMC) composed of self-attention and guided-attention blocks. Experimental results show that CMCN outperforms existing methods on the VQA 2.0 dataset.

SOFT COMPUTING (2021)

Article Chemistry, Analytical

An Effective Dense Co-Attention Networks for Visual Question Answering

Shirong He et al.

SENSORS (2020)

Article Computer Science, Information Systems

Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-Modal Retrieval

Jing Yu et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2020)

Article Engineering, Electrical & Electronic

ARFV: An Efficient Shared Data Auditing Scheme Supporting Revocation for Fog-Assisted Vehicular Ad-Hoc Networks

Mingming Cui et al.

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY (2020)

Article Computer Science, Information Systems

Multi-Modality Global Fusion Attention Network for Visual Question Answering

Cheng Yang et al.

ELECTRONICS (2020)

Article Computer Science, Information Systems

CGMVQA: A New Classification and Generative Model for Medical Visual Question Answering

Fuji Ren et al.

IEEE ACCESS (2020)

Article Computer Science, Artificial Intelligence

Learning Two-Branch Neural Networks for Image-Text Matching Tasks

Liwei Wang et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2019)

Article Computer Science, Information Systems

Deep Memory Network for Cross-Modal Retrieval

Ge Song et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2019)

Article Computer Science, Artificial Intelligence

Visual question answering via Attention-based syntactic structure tree-LSTM

Yun Liu et al.

APPLIED SOFT COMPUTING (2019)

Article Computer Science, Information Systems

An Efficient and Safe Road Condition Monitoring Authentication Scheme Based on Fog Computing

Mingming Cui et al.

IEEE INTERNET OF THINGS JOURNAL (2019)

Proceedings Paper Computer Science, Interdisciplinary Applications

CRA-Net: Composed Relation Attention Network for Visual Question Answering

Liang Peng et al.

PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19) (2019)

Article Computer Science, Information Systems

Cross-Modal Multistep Fusion Network With Co-Attention for Visual Question Answering

Mingrui Lao et al.

IEEE ACCESS (2018)

Article Computer Science, Artificial Intelligence

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

Ranjay Krishna et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2017)

Proceedings Paper Computer Science, Artificial Intelligence

The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions

Peng Wang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Spatial Memory for Context Reasoning in Object Detection

Xinlei Chen et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Article Computer Science, Information Systems

Cross-Modal Retrieval via Deep and Bidirectional Representation Learning

Yonghao He et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2016)

Proceedings Paper Computer Science, Artificial Intelligence

Fast R-CNN

Ross Girshick

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)