4.7 Article

Attention-guided and fine-grained feature extraction from face images for gaze estimation

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.engappai.2023.106994

关键词

Gaze estimation; Attention mechanism; TNT; Coarse; and fine-grained

向作者/读者索取更多资源

Considerable results have been achieved in research on appearance gaze estimation based on deep learning. However, the precision of gaze estimation is reduced due to a lack of information in the extracted local blocks, as CNNs do not prioritize the information within the key picture blocks for face image estimation. In this study, fine-grained visual information is extracted from local blocks using a Transformer in Transformer (TNT) model that emphasizes interactions between local blocks.
Research on appearance gaze estimation based on deep learning has achieved considerable results. However, a lack of information in the extracted local blocks reduces the precision of gaze estimation since CNNs do not prioritize the information within the key picture blocks for face image estimation. In this research, fine-grained visual information is extracted from local blocks using a Transformer in Transformer (TNT) model that emphasizes the interactions between local blocks. First, TNT-based gaze estimation models (GTiT-Pure and GTiTHybrid) are established with the TNT model to capture both coarse- and fine-grained visual information about the face, which are then combined to provide a feature representation for gaze regression. Then, the performance of the two models are evaluated on four gaze estimation datasets. Experimental results demonstrate that the pretrained GTiT-Pure has less gaze estimation error than the majority of CNNs and Transformer models, and that the GTiT-Hybrid model performs the best on the EyeDiap and RT-Gene datasets. The GTiT model which combines both coarse- and fine-grained gaze features in face images, can be used to further explore the advantages of the Transformer model in gaze feature extraction.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据