4.7 Article

HSTA: A Hierarchical Spatio-Temporal Attention Model for Trajectory Prediction

期刊

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY
卷 70, 期 11, 页码 11295-11307

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TVT.2021.3115018

关键词

Trajectory; Predictive models; Computational modeling; Logic gates; Correlation; Training; Decoding; Trajectory prediction; autonomous driving; spatio-temporal modeling

资金

  1. Shanghai Rising Star Program [21QC1400900]
  2. Anhui Provincial Natural Science Foundation, Anhui Energy-Internet Joint Program [2008085UD01]
  3. National Natural Science Foundation of China [U1913601, 61906138]
  4. European Union [945539]

向作者/读者索取更多资源

In this paper, a Hierarchical Spatio-Temporal Attention architecture (HSTA) is proposed to capture spatial interactions using graph attention mechanism (GAT), encode temporal correlations with multi-head attention mechanism (MHA), and integrate spatial and temporal interactions with a state gated fusion (SGF) layer. The experimental results demonstrate that the proposed method outperforms baselines on pedestrian and vehicle datasets, achieving state-of-the-art achievements.
Predicting the future trajectories of surrounding agents has become an crucial problem to be solved for the safety of autonomous vehicles. Recent studies based on Long Short Term Memory (LSTM) networks have shown powerful abilities to model social interactions. However, many of these approaches focus on spatial interactions of the neighborhood agents but ignore temporal interactions that accompany spatial interactions. In this paper, we propose a Hierarchical Spatio-Temporal Attention architecture (HSTA), which activates the utilization of spatial interactions with different weights, and jointly considers the temporal interactions across time steps of all agents. More specially, the graph attention mechanism (GAT) is presented to capture spatial interactions, the multi-head attention mechanism (MHA) is conducted to encode temporal correlations of interactions and a state gated fusion (SGF) layer is used to integrate spatial and temporal interactions. We evaluate our proposed method against baselines on both pedestrian and vehicle datasets. The results show that our model is effective and achieves state-of-the-art achievements.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据