4.7 Article

Temporal Shift and Spatial Attention-Based Two-Stream Network for Traffic Risk Assessment

Journal

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS
Volume 23, Issue 8, Pages 12518-12530

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TITS.2021.3115123

Keywords

Videos; Risk management; Feature extraction; Complexity theory; Accidents; Cameras; Vehicles; Traffic risk assessment; two-stream network; weighted temporal shift module; spatial and channel attention mechanism

Funding

  1. National Nature Science Foundation of China [62176138, 62176136]
  2. Shandong Provincial Key Research and Development Program [2019JZZY010130, 2020CXGC010207]
  3. National Key Research and Development Program of China [2018YFB1305300]

Ask authors/readers for more resources

This study introduces a novel Temporal Shift and Spatial Attention based Two-stream Network (TSSAT-Net) for on-board vision based traffic risk assessment, incorporating new judgement measures integrating actual driving experience and scenario complexity. A weighted Temporal Shift Module is proposed to extract spatial-temporal features effectively, while a spatial and channel attention mechanism is used to focus on features closely related to traffic risks and avoid redundant information.
On-board vision based traffic risk assessment is a challenging task for intelligent driving systems, which has some special challenges including spatial-temporal feature extraction, different judgements of risks, real-time requirement, lacking data, etc. To overcome these difficulties, we propose a novel Temporal Shift and Spatial Attention based Two-stream Network (TSSAT-Net) for on-board vision based traffic risk assessment. Firstly, we build new judgement measures that integrate actual driving experience and scenario complexity, and release an on-board vision based traffic risk assessment dataset. Secondly, a novel weighted Temporal Shift Module (weighted-TSM) based two-stream network is proposed; unlike previous methods that rely on complex and time-consuming 3D CNN or LSTM calculation, the proposed two-stream network can effectively extract spatial-temporal features using a weighted temporal shift mechanism with just 2D CNN calculation requirement. Thirdly, a spatial and channel attention mechanism is proposed to make the TSSAT-Net more focus on the features closely related to traffic risks, avoiding redundant information in complex traffic scenarios. Experiments based on the released comprehensive dataset show that our method achieves the state-of-the-art classification accuracy in real-time.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available