4.7 Article

Temporal Pyramid Network With Spatial-Temporal Attention for Pedestrian Trajectory Prediction

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TNSE.2021.3065019

关键词

Trajectory; Prediction algorithms; Feature extraction; Predictive models; Computational modeling; Task analysis; Modulation; Deep learning; social behavior; social computing; social interactions; spatial-temporal attention; temporal pyramid network; trajectory prediction

资金

  1. Natural Science Foundation of China [62001304, 61871273, 61971476]
  2. Guangdong Basic, and Applied Basic Research Foundation [2019A1515110410]
  3. Macau Science, and Technology Development Fund [SKL-IOTSC-2018-2020, 077/2018/A2, 0060/2019/A1]
  4. Research Committee at the University of Macau [MYRG2018-00029-FST, MYRG2019-00023-FST]

向作者/读者索取更多资源

This paper proposes a novel method for pedestrian trajectory prediction, using a temporal pyramid network and attention mechanism to effectively model and predict complex social interactions. Experimental results demonstrate the superiority of this method.
Understanding and predicting human motion behavior with social interactions have become an increasingly crucial problem for a vast number of applications, ranging from visual navigation of autonomous vehicles to activity prediction of intelligent video surveillance. Accurately forecasting crowd motion behavior is challenging due to the multimodal nature of trajectories and complex social interactions between humans. Recent algorithms model and predict the trajectory with a single resolution, making them difficult to exploit the long-range information and the short-range information of the motion behavior simultaneously. In this paper, we propose a temporal pyramid network for pedestrian trajectory prediction through a squeeze modulation and a dilation modulation. The hierarchical design of our framework allows to model the trajectory with multi-resolution, then can better capture the motion behavior at various tempos. By progressively combining the global context with the local one, we finally construct a coarse-to-fine hierarchical pedestrian trajectory prediction framework with multi-supervision. Further, we introduce a unified spatial-temporal attention mechanism to adaptively select important information of persons around in both spatial and temporal domains. We show that our attention strategy is intuitive and effective to encode the influence of social interactions. Experimental results on two benchmarks demonstrate the superiority of our proposed scheme.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据