☆ 4.6 Article

Predicting Pedestrian Crossing Intention With Feature Fusion and Spatio-Temporal Attention

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES (2022)

期刊

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES

卷 7, 期 2, 页码 221-230

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIV.2022.3162719

关键词

Pedestrian intention; autonomous driving; spatial-temporal fusion

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic Transportation Science & Technology

资金

United States Department of Transportation [69A3551747111]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study presents a novel neural network architecture to predict pedestrian crossing intention by fusing different spatio-temporal features. By optimally combining various phenomena such as RGB imagery sequences, semantic segmentation masks, and ego-vehicle speed, the proposed method achieves state-of-the-art performance.

Predicting vulnerable road user behavior is an essential prerequisite for deploying Automated Driving Systems (ADS) in the real-world. Pedestrian crossing intention should be recognized in real-time, especially for urban driving. Recent works have shown the potential of using vision-based deep neural network models for this task. However, these models are not robust and certain issues still need to be resolved. First, the global spatio-temporal context that accounts for the interaction between the target pedestrian and the scene has not been properly utilized. Second, the optimal strategy for fusing different sensor data has not been thoroughly investigated. This work addresses the above limitations by introducing a novel neural network architecture to fuse inherently different spatio-temporal features for pedestrian crossing intention prediction. We fuse different phenomena such as sequences of RGB imagery, semantic segmentation masks, and ego-vehicle speed in an optimal way using attention mechanisms and a stack of recurrent neural networks. The optimal architecture was obtained through exhaustive ablation and comparison studies. Extensive comparative experiments on the JAAD and PIE pedestrian action prediction benchmarks demonstrate the effectiveness of the proposed method, where state-of-the-art performance was achieved. Our code is open-source and publicly available: https://github.com/OSU-Haolin/Pedestrian_Crossing_Intention_Prediction.

Predicting Pedestrian Crossing Intention With Feature Fusion and Spatio-Temporal Attention

期刊

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Predicting Pedestrian Crossing Intention With Feature Fusion and Spatio-Temporal Attention

期刊

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文