4.6 Article

Pose Refinement Graph Convolutional Network for Skeleton-Based Action Recognition

期刊

IEEE ROBOTICS AND AUTOMATION LETTERS
卷 6, 期 2, 页码 1028-1035

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LRA.2021.3056361

关键词

Deep learning for visual perception; recognition

类别

资金

  1. Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy [EXC 2070 390732324, GA1927/4-2, FOR 2535]
  2. ERC Starting Grant ARCA [677650]

向作者/读者索取更多资源

This study introduces a highly efficient graph convolutional network that addresses the limitations of previous works by gradually fusing motion and spatial information through a parallel structure and reducing the temporal resolution early on. By refining poses before processing and explicitly addressing potential errors, the network achieves a better balance between accuracy, memory usage, and processing time.
With the advances in capturing 2D or 3D skeleton data, skeleton-based action recognition has received an increasing interest over the last years. As skeleton data is commonly represented by graphs, graph convolutional networks have been proposed for this task. While current graph convolutional networks accurately recognize actions, they are too expensive for robotics applications where limited computational resources are available. In this letter, we therefore propose a highly efficient graph convolutional network that addresses the limitations of previous works. This is achieved by a parallel structure that gradually fuses motion and spatial information and by reducing the temporal resolution as early as possible. Furthermore, we explicitly address the issue that human poses can contain errors. To this end, the network first refines the poses before they are further processed to recognize the action. We therefore call the network Pose Refinement Graph Convolutional Network. Compared to other graph convolutional networks, our network requires 86%-93% less parameters and reduces the floating point operations by 89%-96% while achieving a comparable accuracy. It therefore provides a much better trade-off between accuracy, memory footprint and processing time, which makes it suitable for robotics applications.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据