4.6 Article

Magnitude-Orientation Stream network and depth information applied to activity recognition

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jvcir.2019.102596

关键词

Activity recognition; Convolutional neural networks (CNNs); Two-stream convolutional networks; Spatiotemporal information; Optical flow; Depth information

资金

  1. Brazilian National Research Council - CNPq [311053/2016-5, 204952/2017-4, 438629/2018-3]
  2. Minas Gerais Research Foundation FAPEMIG [APQ-00567-14, PPM-00540-17]
  3. Coordination for the Improvement of Higher Education Personnel - CAPES (DeepEyes Project)
  4. NVIDIA Corporation

向作者/读者索取更多资源

The temporal component of videos provides an important clue for activity recognition, as a number of activities can be reliably recognized based on the motion information. In view of that, this work proposes a novel temporal stream for two-stream convolutional networks based on images computed from the optical flow magnitude and orientation, named Magnitude-Orientation Stream (MOS), to learn the motion in a better and richer manner. Our method applies simple non-linear transformations on the vertical and horizontal components of the optical flow to generate input images for the temporal stream. Moreover, we also employ depth information to use as a weighting scheme on the magnitude information to compensate the distance of the subjects performing the activity to the camera. Experimental results, carried on two well-known datasets (UCF101 and NTU), demonstrate that using our proposed temporal stream as input to existing neural network architectures can improve their performance for activity recognition. Results demonstrate that our temporal stream provides complementary information able to improve the classical two-stream methods, indicating the suitability of our approach to be used as a temporal video representation. (C) 2019 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据