☆ 4.7 Article

Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval

IEEE TRANSACTIONS ON IMAGE PROCESSING (2016)

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

卷 25, 期 10, 页码 4540-4554

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIP.2016.2592800

关键词

Multimodal; binary code; hashing; cross-modal retrieval; discriminant analysis

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

National Natural Science Foundation of China [61432014, 61472304, 61501349]
Key Industrial Innovation Chain in Industrial Domain [2016KTZDGY-02]
Program for Changjiang Scholars and Innovative Research Team in University of China [IRT13088]
Shaanxi Innovative Research Team for Key Science and Technology [2012KCT-02]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Multimodal hashing, which conducts effective and efficient nearest neighbor search across heterogeneous data on large-scale multimedia databases, has been attracting increasing interest, given the explosive growth of multimedia content on the Internet. Recent multimodal hashing research mainly aims at learning the compact binary codes to preserve semantic information given by labels. The overwhelming majority of these methods are similarity preserving approaches which approximate pairwise similarity matrix with Hamming distances between the to-be-learnt binary hash codes. However, these methods ignore the discriminative property in hash learning process, which results in hash codes from different classes undistinguished, and therefore reduces the accuracy and robustness for the nearest neighbor search. To this end, we present a novel multimodal hashing method, named multimodal discriminative binary embedding (MDBE), which focuses on learning discriminative hash codes. First, the proposed method formulates the hash function learning in terms of classification, where the binary codes generated by the learned hash functions are expected to be discriminative. And then, it exploits the label information to discover the shared structures inside heterogeneous data. Finally, the learned structures are preserved for hash codes to produce similar binary codes in the same class. Hence, the proposed MDBE can preserve both discriminability and similarity for hash codes, and will enhance retrieval accuracy. Thorough experiments on benchmark data sets demonstrate that the proposed method achieves excellent accuracy and competitive computational efficiency compared with the state-of-the-art methods for large-scale cross-modal retrieval task.

Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文