4.8 Article

ARHPE: Asymmetric Relation-Aware Representation Learning for Head Pose Estimation in Industrial Human-Computer Interaction

Journal

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
Volume 18, Issue 10, Pages 7107-7117

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TII.2022.3143605

Keywords

Head; Magnetic heads; Pose estimation; Task analysis; Informatics; Training; Feature extraction; Asymmetric relation; head pose estimation (HPE); human-computer interaction; label learning; online education

Funding

  1. National Natural Science Foundation of China [62177018, 62173286, 62011530436, 62077020, 62005092, 61875068]
  2. China Unicom Hubei Branch Bilateral Cooperation Research Funds [2021111002002004]
  3. Universities Helping Counties Research Funds of Hubei Province [BXLBX0192]

Ask authors/readers for more resources

This article addresses two key problems in head pose estimation - better prediction performance and using adjacent poses information. It proposes a method using label learning and asymmetric relation cues to improve the accuracy of head pose estimation.
Head pose estimation (HPE) has wide industrial applications, such as online education, human-robot interaction, and automatic manufacturing. In this article, we address two key problems in HPE based on label learning and asymmetric relation cues: 1) how to bridge the gap between the better prediction performance of networks and incorrectly label pose images in the HPE datasets and 2) how to take full advantage of the adjacent poses information around the centered pose image. We reconstruct all the incorrect labels as a two-dimensional Lorentz distribution to tackle the first problem. Instead of directly adopting the angle values as hard labels, we assign part of the probability values (soft labels) to adjacent labels for learning discriminative feature representations. To address the second problem, we reveal the asymmetric relation nature of HPE datasets. The yaw direction and pitch direction are assigned different weights by introducing the half at half-maximum of the Lorentz distribution. Compared with the traditional end-to-end frameworks, the proposed one can leverage the asymmetric relation cues for predicting the head pose angle in the incorrect label scenarios. Extensive experiments on two public datasets and our infrared dataset demonstrate that the proposed ARHPE network significantly outperforms other state-of-the-art approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available