4.2 Article

Passenger Flow Prediction Using Smart Card Data from Connected Bus System Based on Interpretable XGBoost

期刊

出版社

WILEY-HINDAWI
DOI: 10.1155/2022/5872225

关键词

-

资金

  1. Shenzhen Science and Technology Innovation Committee [JCYJ2017 0818142947240]
  2. Foundation for Distinguished Young Talents in Higher Education of Guangdong, China [2019KQNCX126]
  3. Science and Technology Planning Project of Guangdong Province [2018B020207005]

向作者/读者索取更多资源

Bus passenger flow prediction is crucial for public traffic management, and previous studies have used machine learning models to improve prediction accuracy. However, interpretability of the influenced variables remains a challenge. This study proposes a method and implements an XGBoost model to improve prediction accuracy and interpret the results.
Bus passenger flow prediction is a critical component of advanced transportation information system for public traffic management, control, and dispatch. With the development of artificial intelligence, many previous studies attempted to apply machine learning models to extract comprehensive correlations from transit networks to improve passenger flow prediction accuracy, given that the variety and volume of traffic data have been easily obtained. The passenger flow on a station is highly affected by various factors such as the previous time step, peak hours or nonpeak hours, and extracting the key features from the data is essential for a passenger flow prediction model. Although the neural networks, k-nearest neighbor, and some deep learning models have been adopted to mine the temporal correlations of the passenger flow data, the lack of interpretability of the influenced variables is still a big problem. Classical tree-based models can mine the correlations between variables and rank the importance of each variable. In this study, we presented a method to extract passenger flow of different routes on the station and implemented a XGBoost model to find the contributions of variables to the prediction of passenger flow. Comparing to benchmark models, the proposed model can reach state-of-the-art prediction accuracy and computational efficiency on the real-world dataset. Moreover, the XGBoost model can interpret the predicted results. It can be seen that period is the most important variable for the passenger flow prediction, and so the management of buses during peak hours should be improved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据