4.6 Article

A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities

期刊

SENSORS
卷 20, 期 11, 页码 -

出版社

MDPI
DOI: 10.3390/s20113305

关键词

action recognition; weighted rank pooling; weighted dynamic image; 3D convolutional LSTM network; canonical correlation analysis

资金

  1. Chinese Scholarship Council
  2. National Natural Science Foundation of China [61379014]

向作者/读者索取更多资源

The paper presents a novel hybrid network for large-scale action recognition from multiple modalities. The network is built upon the proposed weighted dynamic images. It effectively leverages the strengths of the emerging Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) based approaches to specifically address the challenges that occur in large-scale action recognition and are not fully dealt with by the state-of-the-art methods. Specifically, the proposed hybrid network consists of a CNN based component and an RNN based component. Features extracted by the two components are fused through canonical correlation analysis and then fed to a linear Support Vector Machine (SVM) for classification. The proposed network achieved state-of-the-art results on the ChaLearn LAP IsoGD, NTU RGB+D and Multi-modal & Multi-view & Interactive ((MI)-I-2) datasets and outperformed existing methods by a large margin (over 10 percentage points in some cases).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据