4.7 Article

Inertial Sensor Data to Image Encoding for Human Action Recognition

期刊

IEEE SENSORS JOURNAL
卷 21, 期 9, 页码 10978-10988

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JSEN.2021.3062261

关键词

Support vector machines; Image recognition; Correlation; Inertial sensors; Computational modeling; Markov processes; Feature extraction; Deep learning; human action recognition; image encoding; mutimodal fusion

向作者/读者索取更多资源

Convolutional Neural Networks (CNNs) are used to recognize human actions using inertial sensor data in this study, where four spatial domain methods are employed to transform the sensor data into activity images. The multimodal fusion framework improves the accuracy by convolving each activity image with two spatial filters and fusing the deep features learned by CNN model ResNet-18. The proposed method demonstrates superiority over the current state-of-the-art in human action recognition on inertial datasets.
Convolutional Neural Networks (CNNs) are successful deep learning models in the field of computer vision. To get the maximum advantage of CNN model for Human Action Recognition (HAR) using inertial sensor data, in this paper, we use four types of spatial domain methods for transforming inertial sensor data to activity images, which are then utilized in a novel fusion framework. These four types of activity images are Signal Images (SI), Gramian Angular Field (GAF) Images, Markov Transition Field (MTF) Images and Recurrence Plot (RP) Images. Furthermore, for creating a multimodal fusion framework and to exploit activity images, we made each type of activity images multimodal by convolving with two spatial domain filters: Prewitt filter and High-boost filter. ResNet-18, a CNN model, is used to learn deep features from multi-modalities. Learned features are extracted from the last pooling layer of each ResNet and then fused by canonical correlation based fusion (CCF) for improving the accuracy of human action recognition. These highly informative features are served as input to a multi-class Support Vector Machine (SVM). Experimental results on three publicly available inertial datasets show the superiority of the proposed method over the current state-of-the-art.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据