4.6 Article

NGDNet: Nonuniform Gaussian-label distribution learning for infrared head pose estimation and on-task behavior understanding in the classroom

期刊

NEUROCOMPUTING
卷 436, 期 -, 页码 210-220

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2020.12.090

关键词

Head pose estimation; Deep learning; On-task behavior understanding; Infrared imaging; Soft label; Human?computer interaction; K-12 education

资金

  1. National Natural Science Foundation of China [62005092]
  2. China?s ministry level project [MCM20160409]
  3. National Science Foundation [18YJA880090]

向作者/读者索取更多资源

The paper introduces a novel nonuniform Gaussian-label distribution learning network (NGDNet) for head pose estimation under active infrared illumination, which improves accuracy by revealing the essential properties of head pose images and determining distribution size based on similarity to adjacent hand poses. Experimental results show that the model achieves outstanding performance on several datasets.
Head pose estimation (HPE) under active infrared (IR) illumination has attracted much attention in the fields of computer vision and machine learning. However, IRHPE often suffers from the problems of low-quality IR images and ambiguous head pose. To tackle these issues, we propose a novel nonuniform Gaussian-label distribution learning network (NGDNet) for the HPE task. First, we reveal the essential properties from two different perspectives: 1) two head pose images change differently in pitch and yaw directions with the same angle increasing on the central pose; 2) the IR head pose variation first increases and then decreases in the pitch direction. Subsequently, the first property indicates the pose image label as a nonuniform label distribution (Gaussian function) with different long and short axes. The second property is leveraged to determine the distribution size in accordance with the similarities of adjacent hand poses. Lastly, the proposed NGDNet is verified on a new IRHPE dataset, which is built by our research group. Experimental results on several datasets demonstrate the effectiveness of the proposed model. Compared with conventional algorithms, our NGDNet model achieves state-of-the-art performance with 77.39% on IRHPE, 99.08% on CAS-PEAL-R1, and 87.41% on Pointing'04. Our code is publicly available at https://github.com/TingtingSL/NGDNet. (c) 2021 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据