4.7 Article

Body part relation reasoning network for human activity understanding

Journal

INFORMATION SCIENCES
Volume 619, Issue -, Pages 526-539

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2022.11.012

Keywords

Activity understanding; Convolutional neural network; Relation reasoning; Action recognition; Deep learning

Ask authors/readers for more resources

The wide application of vision sensors has created both opportunities and challenges for intelligent understanding of image and video data. Efficient processing and utilization of this data necessitates research on effective algorithms for human activity understanding. Existing methods for human activity understanding suffer from issues such as low recognition accuracy, high complexity, and poor robustness. In response to these problems, the body part relation reasoning network (BPRRN) is constructed to leverage common-sense knowledge within human bodies for reasoning human activities. Experimental results on Stanford 40 and UCF101 datasets demonstrate the effectiveness of the proposed network.
The wide application of vision sensors has brought great opportunities and challenges to the intelligent understanding of image and video data. Research on effective human activ-ity understanding algorithms is of great significance in efficient processing and utilizing the data. At present, existing human activity understanding methods have the defects of low recognition accuracy, high complexity or poor robustness. In view of the above prob-lems, we construct the body part relation reasoning network (BPRRN). It aims to explore and utilize the common-sense knowledge in human bodies to reason human activities. Firstly, ten body part boxes are generated from each human body with the help of pose estimation and part classification functions. Secondly, with the states of each body part, the relationships between different body parts are explored and utilized for constructing the body part relation reasoning module (BPRRM). Finally, the body part relation reasoning features from each frame are adopted for temporal modeling with the temporal relation reasoning module (TRRM). In this way, the context association between video frames are constructed, which further promotes the reasoning ability of BPRRN. The experiments car-ried out on Stanford 40 and UCF101 demonstrate the effectiveness of the proposed network.(c) 2022 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available