Related references
Note: Only part of the references are listed.Robust Sparse Weighted Classification for Crowdsourcing
Hao Yu et al.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)
Deep Multigraph Hierarchical Enhanced Semantic Representation for Cross-Modal Retrieval
Lei Zhu et al.
IEEE MULTIMEDIA (2022)
PPIS-JOIN: A Novel Privacy-Preserving Image Similarity Join Method
Chengyuan Zhang et al.
NEURAL PROCESSING LETTERS (2022)
Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges
Di Feng et al.
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2021)
HCMSL: Hybrid Cross-modal Similarity Learning for Cross-modal Retrieval
Chengyuan Zhang et al.
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2021)
M2GUDA: Multi-Metrics Graph-Based Unsupervised Domain Adaptation for Cross-Modal Hashing
Chengyuan Zhang et al.
PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21) (2021)
Re-Attention for Visual Question Answering
Wenya Guo et al.
IEEE TRANSACTIONS ON IMAGE PROCESSING (2021)
Stimulus-driven and concept-driven analysis for image caption generation
Songtao Ding et al.
NEUROCOMPUTING (2020)
Multimodal feature fusion by relational reasoning and attention for visual question answering
Weifeng Zhang et al.
INFORMATION FUSION (2020)
Unpaired Multi-Modal Segmentation via Knowledge Distillation
Qi Dou et al.
IEEE TRANSACTIONS ON MEDICAL IMAGING (2020)
Compositional Attention Networks With Two-Stream Fusion for Video Question Answering
Ting Yu et al.
IEEE TRANSACTIONS ON IMAGE PROCESSING (2020)
Multi-Task Consistency-Preserving Adversarial Hashing for Cross-Modal Retrieval
De Xie et al.
IEEE TRANSACTIONS ON IMAGE PROCESSING (2020)
DRAU: Dual Recurrent Attention Units for Visual Question Answering
Ahmed Osman et al.
COMPUTER VISION AND IMAGE UNDERSTANDING (2019)
A long video caption generation algorithm for big video data retrieval
Songtao Ding et al.
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE (2019)
Predicting Visual Features From Text for Image and Video Caption Retrieval
Jianfeng Dong et al.
IEEE TRANSACTIONS ON MULTIMEDIA (2018)
Joint feature selection and graph regularization for modality-dependent cross-modal retrieval
Li Wang et al.
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (2018)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)