4.7 Article

Hybrid Refinement-Correction Heatmaps for Human Pose Estimation

期刊

IEEE TRANSACTIONS ON MULTIMEDIA
卷 23, 期 -, 页码 1330-1342

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2020.2999181

关键词

Pose estimation; Heating systems; Feature extraction; Predictive models; Convolutional neural networks; Detectors; Computer vision; Human pose estimation; pose refinement; pose correction; heatmaps fusion

资金

  1. National Key Research and Development Program of China [2018YFF0300903]
  2. Science and Technology Commission of ShanghaiMunicipality [18410750700, 17411952600, 16DZ0501100, 61872241, 61572316]
  3. Hong Kong Polytechnic University [P0030419, P0030929]

向作者/读者索取更多资源

In this paper, we propose a method called Hybrid-Pose to improve human pose estimation in images. We design two convolutional neural network models using Stacked Hourglass Networks, one for pose refinement and one for pose correction.
In this paper, we present a method (Hybrid-Pose) to improve human pose estimation in images. We adopt Stacked Hourglass Networks to design two convolutional neural network models, RNet for pose refinement and CNet for pose correction. The CNet (Correction Network) guides the pose refinement RNet (Refinement Network) to correct the joint location before generating the final pose. Each of the two models is composed of four hourglasses, and each hourglass generates a group of detection heatmaps for the joints. The RNet model hourglasses have the same structure. However, the CNet model is designed with hourglasses of different structures for pose guidance. Since the pose estimation in RGB images is very sensitive to the image scene, our proposed approach generates multiple outputs of detection heatmaps to broaden the searching scope for the correct joints locations. We use the RNet model to refine the joints locations in each hourglass stage horizontally, then the heatmaps of each stage are fused with the heatmaps of all the CNet model hourglasses vertically in a hybrid manner. Our method shows competitive results with the existing state-of-the-art approaches on MPII and FLIC benchmark datasets. Although our proposed method focuses on improving single-person pose estimation, we also show the influence of this improvement on multi-person pose estimation by detecting multiple people using SSD detector, then estimating the pose of each person individually.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据