4.7 Article

HandyPose: Multi-level framework for hand pose estimation

Journal

PATTERN RECOGNITION
Volume 128, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2022.108674

Keywords

Hand pose estimation; Feature representations; Computer vision

Funding

  1. National Science Foundation [1749376]
  2. Division Of Behavioral and Cognitive Sci
  3. Direct For Social, Behav & Economic Scie [1749376] Funding Source: National Science Foundation

Ask authors/readers for more resources

This paper presents HandyPose, a single-pass, end-to-end trainable architecture for 2D hand pose estimation using a single RGB image as input. The proposed method achieves high accuracy while maintaining manageable size complexity and modularity of the network. The advanced multi-level waterfall module and multi-scale approach contribute to the performance improvement. The results demonstrate that HandyPose is a robust and efficient architecture for 2D hand pose estimation.
Hand pose estimation is a challenging task due to the large number of degrees of freedom and the frequent occlusions of joints. To address these challenges, we propose HandyPose, a single-pass, end -to-end trainable architecture for 2D hand pose estimation using a single RGB image as input. Adopt-ing an encoder-decoder framework with multi-level features, along with a novel multi-level waterfall atrous spatial pooling module for multi-scale representations, our method achieves high accuracy in hand pose while maintaining manageable size complexity and modularity of the network. HandyPose takes a multi-scale approach to representing context by incorporating spatial information at various levels of the network to mitigate the loss of resolution due to pooling. Our advanced multi-level waterfall module leverages the efficiency of progressive cascade filtering while maintaining larger fields-of-view through the concatenation of multi-level features from different levels of the network in the waterfall module. The decoder incorporates both the waterfall and multi-scale features for the generation of accurate joint heatmaps in a single stage. Our results demonstrate state-of-the-art performance on popular datasets and show that HandyPose is a robust and efficient architecture for 2D hand pose estimation.(c) 2022 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available