4.7 Article

Triplet-Based Deep Hashing Network for Cross-Modal Retrieval

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING
卷 27, 期 8, 页码 3893-3903

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2018.2821921

关键词

Deep neural network; hashing; triplet labels; cross-modal retrieval; graph regularization

资金

  1. National Natural Science Foundation of China [61572388, 61703327]
  2. Key Research and Development Program-The Key Industry Innovation Chain of Shaanxi [2017ZDCXL-GY-05-04-02]
  3. Australian Research Council [FL-170100117, DP-180103424, LP-150100671]

向作者/读者索取更多资源

Given the benefits of its low storage requirements and high retrieval efficiency, hashing has recently received increasing attention. In particular, cross-modal hashing has been widely and successfully used in multimedia similarity search applications. However, almost all existing methods employing cross-modal hashing cannot obtain powerful hash codes due to their ignoring the relative similarity between heterogeneous data that contains richer semantic information, leading to unsatisfactory retrieval performance. In this paper, we propose a triplet-based deep hashing (TDH) network for cross-modal retrieval. First, we utilize the triplet labels, which describe the relative relationships among three instances as supervision in order to capture more general semantic correlations between cross-modal instances. We then establish a loss function from the inter-modal view and the intra-modal view to boost the discriminative abilities of the hash codes. Finally, graph regularization is introduced into our proposed TDH method to preserve the original semantic similarity between hash codes in Hamming space. Experimental results show that our proposed method outperforms several state-of-the-art approaches on two popular cross-modal data sets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据