4.7 Article

An Improved Single Shot Multibox for Video-Rate Head Pose Prediction

期刊

IEEE SENSORS JOURNAL
卷 20, 期 20, 页码 12326-12333

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JSEN.2020.2999625

关键词

Representation of Euler angles; visualization of head pose; SSD-like pose detection; driver attention monitoring

资金

  1. Department of Science and Technology of Jiangsu Province, China [BE2018002-2, BE2018002-3]

向作者/读者索取更多资源

Head pose estimation plays a crucial role in attention detection, behavior analysis, human-computer interaction, and eye tracking, etc. The existing landmark-to-pose methods require two steps, not only to detect the key points of the human face, but also to solve the 2D to 3D mapping problem usually by the average head model. We propose an efficient and robust method to monitor the driver's attention. Firstly, the prevailing object detection algorithm SSD which has inherent capabilities of simultaneous classify and regress, is used to create a lightweight network, which avoids the shortcomings of high coupling and time-consuming of the existing methods. Then, single-scale anchors, which have a less computational cost than multi-scale anchors, are adopted for vehicle environments where the ambient light changes dramatically. Finally, by binning continuous angles into specific classes, the 3D angle regression problem is converted into angle classification and face box regression, and our model directly outputs Euler angles (Yaw, Pitch, and Roll) without detecting face landmarks. Experiments on YawDD result that our approach can efficiently perform detection tasks and estimation tasks under the actual driving environment of various luminosity. The mean average errors of prediction in AFLW2000 and 300W-LP are 6.01 degrees and 2.38 degrees, which demonstrates the accuracy of the proposed algorithm.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据