4.4 Article

End-to-end global to local convolutional neural network learning for hand pose recovery in depth data

期刊

IET COMPUTER VISION
卷 16, 期 1, 页码 50-66

出版社

WILEY
DOI: 10.1049/cvi2.12064

关键词

computer vision; data acquisition; human computer interaction; learning (artificial intelligence); pose estimation

资金

  1. MINECO/FEDER [PID2019-105093GB-I00, PID2020-120611RB-I00, RTI2018-095232-B-C22, TIN2015-65464-R]
  2. CERCA Programme/Generalitat de Catalunya
  3. ICREA

向作者/读者索取更多资源

This study introduces a novel hierarchical tree-like structured CNN to address the 3D pose estimation of human hands, training branches to specialize in local poses and fusing features to learn higher order dependencies among joints. Furthermore, a non-rigid data augmentation approach is employed to increase training depth data. Experimental results show competitive performance on various datasets.
Despite recent advances in 3-D pose estimation of human hands, thanks to the advent of convolutional neural networks (CNNs) and depth cameras, this task is still far from being solved in uncontrolled setups. This is mainly due to the highly non-linear dynamics of fingers and self-occlusions, which make hand model training a challenging task. In this study, a novel hierarchical tree-like structured CNN is exploited, in which branches are trained to become specialised in predefined subsets of hand joints called local poses. Further, local pose features, extracted from hierarchical CNN branches, are fused to learn higher order dependencies among joints in the final pose by end-to-end training. Lastly, the loss function used is also defined to incorporate appearance and physical constraints about doable hand motions and deformations. Finally, a non-rigid data augmentation approach is introduced to increase the amount of training depth data. Experimental results suggest that feeding a tree-shaped CNN, specialised in local poses, into a fusion network for modelling joints' correlations and dependencies, helps to increase the precision of final estimations, showing competitive results on NYU, MSRA, Hands17 and SyntheticHand datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据