4.7 Article

A deep learning approach for imbalanced crash data in predicting highway-rail grade crossings accidents

Journal

RELIABILITY ENGINEERING & SYSTEM SAFETY
Volume 216, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.ress.2021.108019

Keywords

Deep learning; Machine learning; Safety; Prediction accuracy; Imbalanced data; Highway rail grade crossing

Funding

  1. North Dakota State University
  2. MountainPlains Consortium (MPC), a university transportation center - U.S. Department of Transportation

Ask authors/readers for more resources

The study explores the use of deep learning for accurately predicting accidents at highway-rail grade crossings, which outperforms traditional machine learning methods, especially in handling imbalanced data. By utilizing a resampling technique, the study addresses issues with imbalanced datasets and validates the prediction performance of the model using various performance evaluation metrics.
Accurate accident prediction for highway-rail grade crossings (HRGCs) is critically important for assisting at-grade safety improvement decision making. Numerous machine-learning methods were developed focusing on predicting accidents and identifying contributing physical and operational characteristics. A more advanced deep learning-based model is explored as a more accurate means of predicting HRGC crashes compared to machine learning-based approaches. In particular, the prediction performance of the convolution neural network (CNN) model is compared to the most commonly used machine learning methods, such as decision tree (DT) and random forests (RF). A 19-year HRGCs data in North Dakota (ND) is used in this study. Training a machine learning model on an imbalanced data (e.g., unequal distribution of labeled data in accident and no-accident classes) introduce unique challenges for accurate prediction especially for minority class. In this paper, a resampling approach was used to address the imbalanced data issue. Various performance measurements are used to compare the models' prediction performance. The results indicate that resampling the imbalanced dataset significantly improves the recall rate. The results also show that the proposed deep learning-based approach which deepens the layer levels and adapts to the training dataset has better prediction performance compared to other machine learning-based methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available