4.7 Article

Composite recurrent network with internal denoising for facial alignment in still and video images in the wild

期刊

IMAGE AND VISION COMPUTING
卷 111, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.imavis.2021.104189

关键词

Facial alignment; Facial tracking; Temporal modeling; Internal denoising

资金

  1. Spanish Ministry of Economy and Competitiveness [TIN2017-90124-P]
  2. Ramon y Cajal Programme
  3. Maria de Maeztu Units of Excellence Programme [MDM-2015-0502]

向作者/读者索取更多资源

Facial alignment is crucial for various facial analysis applications, but current methods may struggle in highly unconstrained conditions. This paper introduces a composite recurrent tracker with internal denoising to address single image facial alignment and deformable facial tracking, achieving accurate and robust performance in challenging settings, as demonstrated through testing against state-of-the-art methods.
Facial alignment is an essential task for many higher level facial analysis applications, such as animation, human activity recognition and human -computer interaction. Although the recent availability of big datasets and powerful deep-learning approaches have enabled major improvements on the state of the art accuracy, the performance of current approaches can severely deteriorate when dealing with images in highly unconstrained conditions, which limits the real-life applicability of such models. In this paper, we propose a composite recurrent tracker with internal denoising that jointly address both single image facial alignment and deformable facial tracking in the wild. Specifically, we incorporate multilayer LSTMs to model temporal dependencies with variable length and introduce an internal denoiser which selectively enhances the input images to improve the robustness of our overall model. We achieve this by combining 4 different sub-networks that specialize in each of the key tasks that are required, namely face detection, bounding-box tracking, facial region validation and facial alignment with internal denoising. These blocks are endowed with novel algorithms resulting in a facial tracker that is both accurate, robust to in-the-wild settings and resilient against drifting. We demonstrate this by testing our model on 300-W and Menpo datasets for single image facial alignment, and 300-VW dataset for deformable facial tracking. Comparison against 20 other state of the art methods demonstrates the excellent performance of the proposed approach. (c) 2021 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http:// creativecommons.org/licenses/by/4.0/).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据