4.6 Article

Using traffic flow characteristics to predict real-time conflict risk: A novel method for trajectory data analysis

Journal

ANALYTIC METHODS IN ACCIDENT RESEARCH
Volume 35, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.amar.2022.100217

Keywords

Real-time conflict risk; Heterogeneity; Random parameter; Machine learning

Funding

  1. National Natural Science Foundation of China [71971222, 71901223]
  2. Natural Science Foundation of Hunan Province [2021JJ40746]
  3. Fundamental Research Funds for the Central Universities of Central South University [2021zzts0172]

Ask authors/readers for more resources

This study explores the relationship between conflicts and traffic flow features, and develops real-time predictive models. The results indicate that traffic flow characteristics have significant impacts on the probability of conflict occurrence, and resampling techniques can improve the model performance.
The real-time conflict prediction model using traffic flow characteristics is much less studied than the crash-based model. This study aims at exploring the relationship between conflicts and traffic flow features with the consideration of heterogeneity and developing predictive models to identify conflict-prone conditions in a real-time manner. The high resolution trajectory data from the HighD dataset is used as empirical data. A novel method with the virtual detector approach for traffic feature extraction and a two-step framework is proposed for the trajectory data analysis. The framework consists of an exploratory study by random parameter logit model with heterogeneity in means and variances and a comparative study on several machine learning methods, including eXtreme Gradient Boosting (Boosting), Random Forest (Bagging), Support Vector Machine (Single-classifier), and Multilayer-Perceptron (Deep neural network). Results indicate that (1) traffic flow characteristics have significant impacts on the probability of conflict occurrence; (2) the statistical model considering mean heterogeneity outperforms the counterpart and lane differences variables are found to significantly impact the means of random parameters for both lane variables and lane differences variables; (3) eXtreme Gradient Boosting trained on an under-sampled dataset turns out to be the best model with the highest AUC of 0.871 and precision of 0.867, showing that re-sampling techniques can significantly improve the model performance. The proposed model is found to be sensitive to the conflict threshold. Sensitivity analysis on feature selection further confirms that the conflict risk prediction should consider both subject lane features and lane difference features, which verifies the consistency with exploratory analysis based on the statistical model. The consistency between statistical models and machine learning methods improves the interpretability of results for the latter one. (c) 2022 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available