4.7 Article

Heterogeneous Face Interpretable Disentangled Representation for Joint Face Recognition and Synthesis

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2021.3071119

关键词

Face recognition; Task analysis; Feature extraction; Image recognition; Sensors; Semantics; Visualization; Disentanglement; heterogeneous face; interpretable representation; joint recognition and synthesis

资金

  1. National Key Research and Development Program of China [2018AAA0103202]
  2. National Natural Science Foundation of China [62072356, 62050175, 62036007, 61922066, 61876142, 61806152, 61772402]
  3. Xidian University Intellifusion Joint Innovation Laboratory of Artificial Intelligence
  4. Fundamental Research Funds for the Central Universities
  5. China Post-Doctoral Science Foundation [2018M631124, 2019T120880]
  6. Key Research and Development Program of Shaanxi [2020ZDLGY08-08]
  7. Guangxi Natural Science Foundation Program [2021GXNSFDA075011]

向作者/读者索取更多资源

The article explores learning interpretable representations for complex heterogeneous faces and proposes the HFIDR and M-HFIDR methods for cross-modality recognition and synthesis tasks, achieving efficiency in face recognition and synthesis.
Heterogeneous faces are acquired with different sensors, which are closer to real-world scenarios and play an important role in the biometric security field. However, heterogeneous face analysis is still a challenging problem due to the large discrepancy between different modalities. Recent works either focus on designing a novel loss function or network architecture to directly extract modality-invariant features or synthesizing the same modality faces initially to decrease the modality gap. Yet, the former always lacks explicit interpretability, and the latter strategy inherently brings in synthesis bias. In this article, we explore to learn the plain interpretable representation for complex heterogeneous faces and simultaneously perform face recognition and synthesis tasks. We propose the heterogeneous face interpretable disentangled representation (HFIDR) that could explicitly interpret dimensions of face representation rather than simple mapping. Benefited from the interpretable structure, we further could extract latent identity information for cross-modality recognition and convert the modality factor to synthesize cross-modality faces. Moreover, we propose a multimodality heterogeneous face interpretable disentangled representation (M-HFIDR) to extend the basic approach suitable for the multimodality face recognition and synthesis. To evaluate the ability of generalization, we construct a novel large-scale face sketch data set. Experimental results on multiple heterogeneous face databases demonstrate the effectiveness of the proposed method.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据