期刊
VISUAL COMPUTER
卷 -, 期 -, 页码 -出版社
SPRINGER
DOI: 10.1007/s00371-023-03121-4
关键词
Cross-resolution face recognition; Convolutional neural network; Multi-resolution feature fusion; Curriculum learning; Surveillance systems
Face recognition in surveillance is challenging due to the disparity between low-resolution camera images and high-resolution database images. This paper proposes a two-stage dual-resolution face network to learn resolution-invariant representations. The network is pre-trained using high-resolution images and then fine-tuned using the triplet loss and competence-based curriculum learning. Experimental results show remarkable face verification accuracy.
Face recognition for surveillance remains a complex challenge due to the disparity between low-resolution (LR) face images captured by surveillance cameras and the typically high-resolution (HR) face images in databases. To address this cross-resolution face recognition problem, we propose a two-stage dual-resolution face network to learn more robust resolution-invariant representations. In the first stage, we pre-train the proposed dual-resolution face network using solely HR images. Our network utilizes a two-branch structure and introduces bilateral connections to fuse the high- and low-resolution features extracted by two branches, respectively. In the second stage, we introduce the triplet loss as the fine-tuning loss function and design a training strategy that combines the triplet loss with competence-based curriculum learning. According to the competence function, the pre-trained model can train first from easy sample sets and gradually progress to more challenging ones. Our method achieves a remarkable face verification accuracy of 99.25% on the native cross-quality dataset SCFace and 99.71% on the high-quality dataset LFW. Moreover, our method also enhances the face verification accuracy on the native low-quality dataset.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据