4.8 Article

Learning Multi-View Interactional Skeleton Graph for Action Recognition

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TPAMI.2020.3032738

关键词

Skeleton; Topology; Feature extraction; Convolution; Network topology; Recurrent neural networks; Action recognition; skeleton; multi-view; graph neural network; hierarchical method

向作者/读者索取更多资源

The paper proposes a multi-view interactional graph network (MV-IGNet) that can construct, learn, and infer multi-level spatial skeleton contexts. MV-IGNet utilizes different skeleton topologies as multi-views to generate complementary action features. Compared to mainstream methods, MV-IGNet has a smaller model size and faster inference.
Capturing the interactions of human articulations lies in the center of skeleton-based action recognition. Recent graph-based methods are inherently limited in the weak spatial context modeling capability due to fixed interaction pattern and inflexible shared weights of GCN. To address above problems, we propose the multi-view interactional graph network (MV-IGNet) which can construct, learn and infer multi-level spatial skeleton context, including view-level (global), group-level, joint-level (local) context, in a unified way. MV-IGNet leverages different skeleton topologies as multi-views to cooperatively generate complementary action features. For each view, separable parametric graph convolution (SPG-Conv) enables multiple parameterized graphs to enrich local interaction patterns, which provides strong graph-adaption ability to handle irregular skeleton topologies. We also partition the skeleton into several groups and then the higher-level group contexts including inter-group and intra-group, are hierarchically captured by above SPG-Conv layers. A simple yet effective global context adaption (GCA) module facilitates representative feature extraction by learning the input-dependent skeleton topologies. Compared to the mainstream works, MV-IGNet can be readily implemented while with smaller model size and faster inference. Experimental results show the proposed MV-IGNet achieves impressive performance on large-scale benchmarks: NTU-RGB+D and NTU-RGB+D 120.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据