☆ 4.7 Article

Heterogeneous Face Interpretable Disentangled Representation for Joint Face Recognition and Synthesis

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

期刊

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

卷 33, 期 10, 页码 5611-5625

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TNNLS.2021.3071119

关键词

Face recognition; Task analysis; Feature extraction; Image recognition; Sensors; Semantics; Visualization; Disentanglement; heterogeneous face; interpretable representation; joint recognition and synthesis

类别

Computer Science, Artificial Intelligence Computer Science, Hardware & Architecture Computer Science, Theory & Methods Engineering, Electrical & Electronic

资金

National Key Research and Development Program of China [2018AAA0103202]
National Natural Science Foundation of China [62072356, 62050175, 62036007, 61922066, 61876142, 61806152, 61772402]
Xidian University Intellifusion Joint Innovation Laboratory of Artificial Intelligence
Fundamental Research Funds for the Central Universities
China Post-Doctoral Science Foundation [2018M631124, 2019T120880]
Key Research and Development Program of Shaanxi [2020ZDLGY08-08]
Guangxi Natural Science Foundation Program [2021GXNSFDA075011]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The article explores learning interpretable representations for complex heterogeneous faces and proposes the HFIDR and M-HFIDR methods for cross-modality recognition and synthesis tasks, achieving efficiency in face recognition and synthesis.

Heterogeneous faces are acquired with different sensors, which are closer to real-world scenarios and play an important role in the biometric security field. However, heterogeneous face analysis is still a challenging problem due to the large discrepancy between different modalities. Recent works either focus on designing a novel loss function or network architecture to directly extract modality-invariant features or synthesizing the same modality faces initially to decrease the modality gap. Yet, the former always lacks explicit interpretability, and the latter strategy inherently brings in synthesis bias. In this article, we explore to learn the plain interpretable representation for complex heterogeneous faces and simultaneously perform face recognition and synthesis tasks. We propose the heterogeneous face interpretable disentangled representation (HFIDR) that could explicitly interpret dimensions of face representation rather than simple mapping. Benefited from the interpretable structure, we further could extract latent identity information for cross-modality recognition and convert the modality factor to synthesize cross-modality faces. Moreover, we propose a multimodality heterogeneous face interpretable disentangled representation (M-HFIDR) to extend the basic approach suitable for the multimodality face recognition and synthesis. To evaluate the ability of generalization, we construct a novel large-scale face sketch data set. Experimental results on multiple heterogeneous face databases demonstrate the effectiveness of the proposed method.

Heterogeneous Face Interpretable Disentangled Representation for Joint Face Recognition and Synthesis

期刊

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Heterogeneous Face Interpretable Disentangled Representation for Joint Face Recognition and Synthesis

期刊

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文