4.8 Article

Deep Multi-View Enhancement Hashing for Image Retrieval

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPAMI.2020.2975798

Keywords

Image retrieval; Binary codes; Machine learning; Training; Feature extraction; Neural networks; Stability analysis; Multi-view hashing; multi-view enhancement; image retrieval

Funding

  1. National Nature Science Foundation of China [61931008, 61671267, 61671196, 61701149, 61801157, 61971268, 61901145, 61901150, 61972123]
  2. National Natural Science Major Foundation of Research Instrumentation of PR China [61427808]
  3. Zhejiang Province Nature Science Foundation of China [LR17F030006, Q19F010030]
  4. 111 Project [D17019]

Ask authors/readers for more resources

This paper proposes a novel multi-view hashing learning method, integrating neural networks to enhance retrieval performance significantly. By effectively evaluating view stability and fusing multiple data, relationships between views are explored and advantages are preserved.
Hashing is an efficient method for nearest neighbor search in large-scale data space by embedding high-dimensional feature descriptors into a similarity preserving Hamming space with a low dimension. However, large-scale high-speed retrieval through binary code has a certain degree of reduction in retrieval accuracy compared to traditional retrieval methods. We have noticed that multi-view methods can well preserve the diverse characteristics of data. Therefore, we try to introduce the multi-view deep neural network into the hash learning field, and design an efficient and innovative retrieval model, which has achieved a significant improvement in retrieval performance. In this paper, we propose a supervised multi-view hash model which can enhance the multi-view information through neural networks. This is a completely new hash learning method that combines multi-view and deep learning methods. The proposed method utilizes an effective view stability evaluation method to actively explore the relationship among views, which will affect the optimization direction of the entire network. We have also designed a variety of multi-data fusion methods in the Hamming space to preserve the advantages of both convolution and multi-view. In order to avoid excessive computing resources on the enhancement procedure during retrieval, we set up a separate structure called memory network which participates in training together. The proposed method is systematically evaluated on the CIFAR-10, NUS-WIDE and MS-COCO datasets, and the results show that our method significantly outperforms the state-of-the-art single-view and multi-view hashing methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available