4.8 Article

Integrating Multi-Label Contrastive Learning With Dual Adversarial Graph Neural Networks for Cross-Modal Retrieval

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Artificial Intelligence

A Comprehensive Survey on Graph Neural Networks

Zonghan Wu et al.

Summary: This article provides a comprehensive overview of graph neural networks (GNNs) in data mining and machine learning fields. It discusses the taxonomy of GNNs, their applications, and summarizes open-source codes, benchmark data sets, and model evaluation. The article also proposes potential research directions in this rapidly growing field.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2021)

Article Computer Science, Artificial Intelligence

MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval

Xin Liu et al.

Summary: This paper proposes a Matrix Tri-Factorization Hashing (MTFH) framework, which is generalized and flexible in cross-modal retrieval. It can adapt to various scenarios of multi-modal data and learn hash codes of different lengths to better handle challenging tasks.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2021)

Article Computer Science, Artificial Intelligence

AutoML for Multi-Label Classification: Overview and Empirical Evaluation

Marcel Wever et al.

Summary: Automated machine learning (AutoML) enables the automatic construction and customization of machine learning algorithms for supervised learning tasks, showing impressive results in the realm of single-label classification (SLC). When extending AutoML approaches to multi-label classification (MLC), the complexity of the search space increases and requires more efficient optimizers to handle the higher complexity.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Probabilistic Embeddings for Cross-Modal Retrieval

Sanghyuk Chun et al.

Summary: Cross-modal retrieval methods aim to build a common representation space for samples from different modalities, such as vision and language. This paper introduces Probabilistic Cross-Modal Embedding (PCME) to represent samples as probabilistic distributions, showing improved retrieval performance and providing uncertainty estimates for better interpretability. By evaluating on the CUB dataset with exhaustive annotations, PCME outperforms deterministic methods in capturing one-to-many correspondences.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Learning Cross-Modal Retrieval with Noisy Labels

Peng Hu et al.

Summary: This paper proposes a general Multimodal Robust Learning framework (MRL) to learn with multimodal noisy labels and mitigate noisy samples while correlating different modalities. The Robust Clustering loss (RC) is introduced to focus on clean samples instead of noisy ones, and the Multimodal Contrastive loss (MC) aims to maximize mutual information between different modalities to alleviate the interference of noisy samples and cross-modal discrepancy. Extensive experiments on four widely-used multimodal datasets show the effectiveness of the proposed approach compared to 14 state-of-the-art methods.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Article Computer Science, Information Systems

Instance-Level Heterogeneous Domain Adaptation for Limited-Labeled Sketch-to-Photo Retrieval

Fan Yang et al.

Summary: This paper proposes an Instance-level Heterogeneous Domain Adaptation (IHDA) framework to address domain adaptation issues in sketch-to-photo retrieval. By utilizing fine-tuning for identity label learning, shared attributes, and domain adaptation, instance-level knowledge transfer is facilitated.

IEEE TRANSACTIONS ON MULTIMEDIA (2021)

Article Computer Science, Information Systems

Adaptive Label-Aware Graph Convolutional Networks for Cross-Modal Retrieval

Shengsheng Qian et al.

Summary: In this paper, a novel end-to-end adaptive label-aware graph convolutional network (ALGCN) is proposed for cross-modal retrieval, achieving modality-invariant and discriminative representations by designing instance and label representation learning branches. The ALGCN model outperforms state-of-the-art methods in cross-modal retrieval on benchmark datasets like NUS-WIDE, MIRFlickr and MS-COCO.

IEEE TRANSACTIONS ON MULTIMEDIA (2021)

Article Computer Science, Information Systems

Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data

Guoli Song et al.

Summary: The proposed FLPCL method utilizes deep feature learning and partial correlation learning to infer relationships between modalities and learn effective multimodal representations. It outperforms state-of-the-art methods on cross-modal retrieval tasks.

IEEE TRANSACTIONS ON MULTIMEDIA (2021)

Article Computer Science, Information Systems

Disentangling, Embedding and Ranking Label Cues for Multi-Label Image Recognition

Zhao-Min Chen et al.

Summary: The paper presents a unified deep learning framework DER for multi-label image recognition, aiming to explicitly model the correlations between labels. Through operations such as embedding, ranking, and disentangling, our model consistently outperforms previous competitive methods on three different datasets.

IEEE TRANSACTIONS ON MULTIMEDIA (2021)

Article Computer Science, Information Systems

Deep Collaborative Discrete Hashing With Semantic-Invariant Structure Construction

Zijian Wang et al.

Summary: The paper presents a dual-stream learning framework called Deep Collaborative Discrete Hashing (DCDH) which collaboratively constructs a discriminative common discrete space from visual and semantic features, achieving state-of-the-art image retrieval performance in large-scale multimedia retrieval tasks.

IEEE TRANSACTIONS ON MULTIMEDIA (2021)

Article Automation & Control Systems

Learning Graph Embedding With Adversarial Training Methods

Shirui Pan et al.

IEEE TRANSACTIONS ON CYBERNETICS (2020)

Proceedings Paper Computer Science, Information Systems

Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval

Song Liu et al.

PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20) (2020)

Article Computer Science, Artificial Intelligence

Collective Affinity Learning for Partial Cross-Modal Hashing

Jun Guo et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2020)

Article Computer Science, Artificial Intelligence

Fast Multi-Instance Multi-Label Learning

Sheng-Jun Huang et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2019)

Article Computer Science, Artificial Intelligence

Multi-modal semantic autoencoder for cross-modal retrieval

Yiling Wu et al.

NEUROCOMPUTING (2019)

Article Computer Science, Information Systems

CM-GANs: Cross-modal Generative Adversarial Networks for Common Representation Learning

Yuxin Peng et al.

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval

Shupeng Su et al.

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Multi-Label Image Recognition with Graph Convolutional Networks

Zhao-Min Chen et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Article Computer Science, Artificial Intelligence

Linear Subspace Ranking Hashing for Cross-Modal Retrieval

Kai Li et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Article Computer Science, Information Systems

Multi-Modal Event Topic Model for Social Event Analysis

Shengsheng Qian et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2016)

Article Computer Science, Artificial Intelligence

Co-Labeling for Multi-View Weakly Labeled Learning

Xinxing Xu et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2016)

Article Computer Science, Artificial Intelligence

Joint Feature Selection and Subspace Learning for Cross-Modal Retrieval

Kaiye Wang et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2016)

Proceedings Paper Computer Science, Artificial Intelligence

Learning Query and Image Similarities with Ranking Canonical Correlation Analysis

Ting Yao et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Article Computer Science, Theory & Methods

Multimedia Search Reranking: A Literature Survey

Tao Mei et al.

ACM COMPUTING SURVEYS (2014)

Article Engineering, Electrical & Electronic

Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization

Xiaohua Zhai et al.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2014)

Article Computer Science, Artificial Intelligence

The Graph Neural Network Model

Franco Scarselli et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS (2009)