期刊
出版社
IEEE
DOI: 10.1109/IJCNN52387.2021.9534440
关键词
-
类别
资金
- European Union's Horizon 2020 Research and Innovation Action Program [871449]
Graph convolutional networks have shown promising results in skeleton-based human action recognition by modeling skeletons as a spatio-temporal graph, with recent methods focusing on learning the graph structure using spatial attention. This paper proposes symmetric spatial attention to better capture the symmetric property of human body joints during actions, and introduces the spatio-temporal bilinear network (ST-BLN) as a more flexible alternative to predefined adjacency matrices. Experimental results demonstrate that all three models perform equally well, with the ST-BLN offering increased efficiency.
Graph convolutional networks (GCNs) achieved promising performance in skeleton-based human action recognition by modeling a sequence of skeletons as a spatio-temporal graph. Most of the recently proposed GCN-based methods improve the performance by learning the graph structure at each layer of the network using spatial attention applied on a predefined graph Adjacency matrix that is optimized jointly with model's parameters in an end-to-end manner. In this paper, we analyze the spatial attention used in spatio-temporal GCN layers and propose a symmetric spatial attention for better reflecting the symmetric property of the relative positions of the human body joints when executing actions. We also highlight the connection of spatio-temporal GCN layers employing additive spatial attention to bilinear layers, and we propose the spatio-temporal bilinear network (ST-BLN) which does not require the use of predefined Adjacency matrices and allows for more flexible design of the model. Experimental results show that the three models lead to effectively the same performance. Moreover, by exploiting the flexibility provided by the proposed ST-BLN, one can increase the efficiency of the model.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据