4.6 Article

Effects of Data Characteristics on Bus Travel Time Prediction: A Systematic Study

Journal

SUSTAINABILITY
Volume 15, Issue 6, Pages -

Publisher

MDPI
DOI: 10.3390/su15064731

Keywords

travel time data analysis; bus travel time; clustering; prediction; machine learning techniques

Ask authors/readers for more resources

The accuracy of predicting bus travel time is an important step in improving the quality of public transportation. Previous studies have used chronological factors to identify significant regressors for predicting travel times. However, travel time patterns can vary depending on time and location. This study systematically analyzes the impact of different ways of presenting input data to the prediction algorithm. The results show that grouping the dataset and training separate models on them, particularly considering data-derived clusters, can significantly improve prediction accuracy.
The prediction of bus travel time with accuracy is a significant step toward improving the quality of public transportation. Drawing meaningful inferences from the data and using these to aid in prediction tasks is always an area of interest. Earlier studies predicted bus travel times by identifying significant regressors, which were identified based on chronological factors. However, travel time patterns may vary depending on time and location. A related question is whether the prediction accuracy can be improved with the choice of input variables. The present study analyzes this question systematically by presenting the input data in different ways to the prediction algorithm. The prediction accuracy increased when the dataset was grouped, and separate models were trained on them, the highest accurate case being the one where the data-derived clusters were considered. This demonstrates that understanding patterns and groups within the dataset helps in improving prediction accuracy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available