4.7 Article

Predicting the probability distribution of bus travel time to measure the reliability of public transport services

Journal

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.trc.2022.103619

Keywords

Long-term prediction; Travel time; Probabilistic model; Public transport; Reliability

Funding

  1. GIRO Inc.
  2. Natural Sciences and Engineering Research Council of Canada [CRDPJ 520349-17]

Ask authors/readers for more resources

This study compares two types of probabilistic models and finds that a similarity-based density estimation model and a smoothed logistic regression model for probabilistic classification perform best in predicting the conditional probability density function of travel time. These models can improve the accuracy of predicting the reliability of public transport services.
An important aspect of the quality of a public transport service is its reliability, which is defined as the invariability of the service attributes. In order to measure reliability during the service planning phase, a key piece of information is the long-term prediction of the density of the travel time, which conveys the uncertainty of travel times. This work empirically compares probabilistic models for the prediction of the conditional probability density function (PDF) of the travel time and proposes a simulation framework taking as input the latter distributions to approximate the expected secondary delays, a measure of the reliability of public transport services. Two types of probabilistic models, namely similarity-based density estimation models and a smoothed logistic regression for probabilistic classification model, are compared on a dataset of more than 41,000 trips and 50 bus routes of the city of Montreal. A similarity-based density estimation model using a k-nearest neighbors method and a log-logistic distribution predicted the best estimate of the true conditional PDF of the travel time and generated the most accurate approximations of the expected secondary delays on this dataset. This model reduced the mean squared error of the expected secondary delay by approximately 9% compared to the benchmark model, namely a random forest. This result highlights the added value of modeling the conditional PDF of the travel time with probabilistic models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available