4.6 Article

Attentional weighting strategy-based dynamic GCN for skeleton-based action recognition

Journal

MULTIMEDIA SYSTEMS
Volume 29, Issue 4, Pages 1941-1954

Publisher

SPRINGER
DOI: 10.1007/s00530-023-01082-1

Keywords

Skeleton-based action recognition; Graph topology; Position feature

Ask authors/readers for more resources

This paper proposes a novel attentional weighting strategy-based dynamic GCN (AWD-GCN) to extract discriminative action features by capturing the dynamic relationships among the three partitions of the human skeleton. The method uses a new dynamic adjacency matrix and attention weighting mechanism. Multi-scale position attention and multi-level attention are also proposed to differentiate human action in different spatial scales. Experimental results on challenging datasets demonstrate the effectiveness and superiority of the proposed method.
Graph Convolutional Networks (GCNs) have become the standard skeleton-based human action recognition research paradigm. As a core component in graph convolutional networks, the construction of graph topology often significantly impacts the accuracy of classification. Considering that the fixed physical graph topology cannot capture the non-physical connection relationship of the human body, existing methods capture more flexible node relationships by constructing dynamic graph structures. This paper proposes a novel attentional weighting strategy-based dynamic GCN (AWD-GCN). We construct a new dynamic adjacency matrix, which uses the attention weighting mechanism to simultaneously capture the dynamic relationships among the three partitions of the human skeleton under multiple actions to extract the discriminative action features fully. In addition, considering the importance of skeletal node position features for action differentiation, we propose new multi-scale position attention and multi-level attention. We use a multi-scale modelling method to capture the complex relationship between skeletal node position features, which is helpful in distinguishing human action in different spatial scales. Extensive experiments on two challenging datasets, NTU-RGB+D and Skeleton-Kinetics, demonstrate the effectiveness and superiority of our method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available