☆ 4.6 Article

View Enhanced Jigsaw Puzzle for Self-Supervised Feature Learning in 3D Human Action Recognition

IEEE ACCESS (2022)

期刊

IEEE ACCESS

卷 10, 期 -, 页码 36385-36396

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/ACCESS.2022.3165040

关键词

Task analysis; Skeleton; Feature extraction; Training; Representation learning; Three-dimensional displays; Image recognition; Action recognition; self-supervised learning; multi-view; pretext task; human skeleton; gate recurrent unit

类别

Computer Science, Information Systems Engineering, Electrical & Electronic Telecommunications

资金

National Key Research and Development Program of China [2018YFB2003200, 2018YFB2003500]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

A novel self-supervised learning method is proposed in this study, which introduces the view enhanced jigsaw puzzle (VEJP) pretext task and the view pooling encoder (VPE) to improve feature learning. Experimental results show that moderately difficult pretext tasks can effectively enhance feature learning.

Self-supervised learning methods have received much attention in skeleton-based human action recognition. These methods rely on pretext tasks to utilize unlabeled data and learn an effective feature encoder. In this paper, a novel self-supervised learning method is proposed. First, we design a new pretext task called view enhanced jigsaw puzzle (VEJP) to improve the learning difficulty of the encoder. The VEJP introduces multi-view information into the jigsaw puzzle, thus forcing the encoder to learn view-independent high-level features of human skeletons. Based on the encoder trained by VEJP, we propose the view pooling encoder (VPE) to integrate the information of multiple views with the pooling mechanism, and the features extracted by VPE are more robust and distinguishable. In addition, by adjusting the difficulty of VEJP, the influence of the pretext task difficulty on the downstream task performance is studied, and the experimental results show that the pretext tasks should be moderately difficult to achieve effective feature learning. Our method achieves competitive results on representative benchmark datasets. It provides a strong baseline for the jigsaw puzzle task and shows advantages in situations where the number of labeled data is minimal.

View Enhanced Jigsaw Puzzle for Self-Supervised Feature Learning in 3D Human Action Recognition

期刊

IEEE ACCESS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

View Enhanced Jigsaw Puzzle for Self-Supervised Feature Learning in 3D Human Action Recognition

期刊

IEEE ACCESS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文