4.7 Article

Deep CNN based binary hash video representations for face retrieval

Journal

PATTERN RECOGNITION
Volume 81, Issue -, Pages 357-369

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2018.04.014

Keywords

Face video retrieval; Cross-domain face retrieval; Deep CNN; Hash learning

Funding

  1. Natural Science Foundation of China (NSFC) [61472038, 61375044]

Ask authors/readers for more resources

In this paper, a novel deep convolutional neural network is proposed to learn discriminative binary hash video representations for face retrieval. The network integrates face feature extractor and hash functions into a unified optimization framework to make the two components be as compatible as possible. In order to achieve better initializations for the optimization, the low-rank discriminative binary hashing method is introduced to pre-learn the hash functions of the network during the training procedure. The input to the network is a face frame, and the output is the corresponding binary hash frame representation. Frame representations of a face video shot are fused by hard voting to generate the binary hash video representation. Each bit in the binary representation of frame/video describes the presence or absence of a face attribute, which makes it possible to retrieve faces among both the image and video domains. Extensive experiments are conducted on two challenging TV-Series datasets, and the excellent performance demonstrates the effectiveness of the proposed network. (C) 2018 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available