期刊
PATTERN RECOGNITION
卷 128, 期 -, 页码 -出版社
ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2022.108674
关键词
Hand pose estimation; Feature representations; Computer vision
资金
- National Science Foundation [1749376]
- Division Of Behavioral and Cognitive Sci
- Direct For Social, Behav & Economic Scie [1749376] Funding Source: National Science Foundation
This paper presents HandyPose, a single-pass, end-to-end trainable architecture for 2D hand pose estimation using a single RGB image as input. The proposed method achieves high accuracy while maintaining manageable size complexity and modularity of the network. The advanced multi-level waterfall module and multi-scale approach contribute to the performance improvement. The results demonstrate that HandyPose is a robust and efficient architecture for 2D hand pose estimation.
Hand pose estimation is a challenging task due to the large number of degrees of freedom and the frequent occlusions of joints. To address these challenges, we propose HandyPose, a single-pass, end -to-end trainable architecture for 2D hand pose estimation using a single RGB image as input. Adopt-ing an encoder-decoder framework with multi-level features, along with a novel multi-level waterfall atrous spatial pooling module for multi-scale representations, our method achieves high accuracy in hand pose while maintaining manageable size complexity and modularity of the network. HandyPose takes a multi-scale approach to representing context by incorporating spatial information at various levels of the network to mitigate the loss of resolution due to pooling. Our advanced multi-level waterfall module leverages the efficiency of progressive cascade filtering while maintaining larger fields-of-view through the concatenation of multi-level features from different levels of the network in the waterfall module. The decoder incorporates both the waterfall and multi-scale features for the generation of accurate joint heatmaps in a single stage. Our results demonstrate state-of-the-art performance on popular datasets and show that HandyPose is a robust and efficient architecture for 2D hand pose estimation.(c) 2022 Elsevier Ltd. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据