☆ 4.6 Article

Magnitude-Orientation Stream network and depth information applied to activity recognition

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (2019)

期刊

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION

卷 63, 期 -, 页码 -

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

DOI: 10.1016/j.jvcir.2019.102596

关键词

Activity recognition; Convolutional neural networks (CNNs); Two-stream convolutional networks; Spatiotemporal information; Optical flow; Depth information

类别

Computer Science, Information Systems Computer Science, Software Engineering

资金

Brazilian National Research Council - CNPq [311053/2016-5, 204952/2017-4, 438629/2018-3]
Minas Gerais Research Foundation FAPEMIG [APQ-00567-14, PPM-00540-17]
Coordination for the Improvement of Higher Education Personnel - CAPES (DeepEyes Project)
NVIDIA Corporation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The temporal component of videos provides an important clue for activity recognition, as a number of activities can be reliably recognized based on the motion information. In view of that, this work proposes a novel temporal stream for two-stream convolutional networks based on images computed from the optical flow magnitude and orientation, named Magnitude-Orientation Stream (MOS), to learn the motion in a better and richer manner. Our method applies simple non-linear transformations on the vertical and horizontal components of the optical flow to generate input images for the temporal stream. Moreover, we also employ depth information to use as a weighting scheme on the magnitude information to compensate the distance of the subjects performing the activity to the camera. Experimental results, carried on two well-known datasets (UCF101 and NTU), demonstrate that using our proposed temporal stream as input to existing neural network architectures can improve their performance for activity recognition. Results demonstrate that our temporal stream provides complementary information able to improve the classical two-stream methods, indicating the suitability of our approach to be used as a temporal video representation. (C) 2019 Elsevier Inc. All rights reserved.

Magnitude-Orientation Stream network and depth information applied to activity recognition

期刊

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Magnitude-Orientation Stream network and depth information applied to activity recognition

期刊

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文