☆ 4.8 Article

UniPose plus : A Unified Framework for 2D and 3D Human Pose Estimation in Images and Videos

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

期刊

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

卷 44, 期 12, 页码 9641-9653

出版社

IEEE COMPUTER SOC

DOI: 10.1109/TPAMI.2021.3124736

关键词

Three-dimensional displays; Pose estimation; Videos; Computer architecture; Task analysis; Decoding; Biological system modeling; Human pose estimation; 3D human pose estimation; computer vision; deep learning

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

National Science Foundation [1749376]
Direct For Social, Behav & Economic Scie
Division Of Behavioral and Cognitive Sci [1749376] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

We propose UniPose+, a unified framework for 2D and 3D human pose estimation in images and videos. The framework leverages multi-scale feature representations and achieves pose estimation without increasing network size and postprocessing. Our results demonstrate that UniPose+ is a robust and efficient architecture.

We propose UniPose+, a unified framework for 2D and 3D human pose estimation in images and videos. The UniPose+ architecture leverages multi-scale feature representations to increase the effectiveness of backbone feature extractors, with no significant increase in network size and no postprocessing. Current pose estimation methods heavily rely on statistical postprocessing or predefined anchor poses for joint localization. The UniPose+ framework incorporates contextual information across scales and joint localization with Gaussian heatmap modulation at the decoder output to estimate 2D and 3D human pose in a single stage with state-of-the-art accuracy, without relying on predefined anchor poses. The multi-scale representations allowed by the waterfall module in the UniPose+ framework leverage the efficiency of progressive filtering in the cascade architecture, while maintaining multi-scale fields-of-view comparable to spatial pyramid configurations. Our results on multiple datasets demonstrate that UniPose+, with a HRNet, ResNet or SENet backbone and waterfall module, is a robust and efficient architecture for single person 2D and 3D pose estimation in single images and videos.

UniPose plus : A Unified Framework for 2D and 3D Human Pose Estimation in Images and Videos

期刊

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

UniPose plus : A Unified Framework for 2D and 3D Human Pose Estimation in Images and Videos

期刊

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文