4.7 Article

Geographically weighted random forests for macro-level crash frequency prediction

期刊

ACCIDENT ANALYSIS AND PREVENTION
卷 194, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.aap.2023.107370

关键词

Road traffic safety; Random forest; Spatial analysis; Crash frequency prediction

向作者/读者索取更多资源

Machine learning models, such as random forests, have been widely used in the field of road safety. However, the traditional RF algorithm fails to capture spatial variability. To address this, a modified algorithm called geographically weighted random forest (GWRF) is employed. The results from analyzing London data show that GWRF outperforms RF and GWR, and is not affected by multicollinearity.
Machine learning models such as random forests (RF) have been widely applied in the field of road safety. RF is a prominent algorithm, overcoming the limitations of using a single decision tree such as overfitting and instability. However, the traditional RF is a global concept, and thus may fail to capture spatial variability. In macrolevel analysis of road safety, the relationship between crash frequency and risk factors can vary spatially. To address this issue, we employ a modified RF algorithm, named geographically weighted random forest (GWRF). Based on the data from London at the level of Middle-super-output-area (MSOA), the predictive performances of RF and GWRF are compared using mean absolute error (MAE) and root mean square error (RMSE). Moreover, considering MSOAs are geographically connected with each other, several factors related to the discrepancies between adjacent zones are also included in the models. Our results indicate that GWRF outperforms the traditional RF and GWR when an appropriate bandwidth is selected. We further explore the effects of multicollinearity on model performance. The results show that prediction accuracy of GWRF models are not susceptible to the multicollinearity. However, the importance values of those variables with multicollinearity may reduce. Finally, and of equal importance, it is found that the importance of each explanatory variable varies across zones. The density of minor road makes the highest contribution to crash frequency in downtown area, while the crash frequency in peripheral area is more sensitive to the discrepancy of road environment between MSOAs. With such information, road safety interventions can be designed and implemented according to the locally important factors, avoiding thus general guidelines addressed for the entire city.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据