☆ 4.7 Article

HandyPose: Multi-level framework for hand pose estimation

PATTERN RECOGNITION (2022)

期刊

PATTERN RECOGNITION

卷 128, 期 -, 页码 -

出版社

ELSEVIER SCI LTD

DOI: 10.1016/j.patcog.2022.108674

关键词

Hand pose estimation; Feature representations; Computer vision

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

National Science Foundation [1749376]
Division Of Behavioral and Cognitive Sci
Direct For Social, Behav & Economic Scie [1749376] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper presents HandyPose, a single-pass, end-to-end trainable architecture for 2D hand pose estimation using a single RGB image as input. The proposed method achieves high accuracy while maintaining manageable size complexity and modularity of the network. The advanced multi-level waterfall module and multi-scale approach contribute to the performance improvement. The results demonstrate that HandyPose is a robust and efficient architecture for 2D hand pose estimation.

Hand pose estimation is a challenging task due to the large number of degrees of freedom and the frequent occlusions of joints. To address these challenges, we propose HandyPose, a single-pass, end -to-end trainable architecture for 2D hand pose estimation using a single RGB image as input. Adopt-ing an encoder-decoder framework with multi-level features, along with a novel multi-level waterfall atrous spatial pooling module for multi-scale representations, our method achieves high accuracy in hand pose while maintaining manageable size complexity and modularity of the network. HandyPose takes a multi-scale approach to representing context by incorporating spatial information at various levels of the network to mitigate the loss of resolution due to pooling. Our advanced multi-level waterfall module leverages the efficiency of progressive cascade filtering while maintaining larger fields-of-view through the concatenation of multi-level features from different levels of the network in the waterfall module. The decoder incorporates both the waterfall and multi-scale features for the generation of accurate joint heatmaps in a single stage. Our results demonstrate state-of-the-art performance on popular datasets and show that HandyPose is a robust and efficient architecture for 2D hand pose estimation.(c) 2022 Elsevier Ltd. All rights reserved.

HandyPose: Multi-level framework for hand pose estimation

期刊

PATTERN RECOGNITION

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

HandyPose: Multi-level framework for hand pose estimation

期刊

PATTERN RECOGNITION

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文