4.5 Article

Predicting the travel mode choice with interpretable machine learning techniques: A comparative study

Journal

TRAVEL BEHAVIOUR AND SOCIETY
Volume 29, Issue -, Pages 279-296

Publisher

ELSEVIER
DOI: 10.1016/j.tbs.2022.07.003

Keywords

Travel behavior; Travel mode choice; Machine learning; Light Gradient Boosting model; SHAP analysis; Feature importance

Categories

Funding

  1. Imam Abdulrahman Bin Faisal University

Ask authors/readers for more resources

This study proposes a systematic machine learning framework to understand travelers' mode choice decisions. Five ML models were developed and evaluated using Dutch National Travel Survey data, with LightGBDT outperforming other models. Variable importance and SHAP dependency analysis highlighted the factors that significantly influence mode choice.
Prediction of mode choice for travelers has been the subject of keen interest among transportation planners. Traditionally, mode choice analysis is conducted by statistical models or simple machine learning (ML) paradigms. Although statistical analysis approaches have a good theoretical basis and interpretability, they are built on several unrealistic assumptions regarding the distribution of data, which may lead to biased model predictions. On the other hand, the ML methods widely used in this regard have poor interpretability and fail to capture the behavioral aspects. To fill this gap, this study proposes a systematic machine learning (ML) framework for a better understanding of traveler's mode choice decisions. Five different ML models (Logistic Regression, Random Forests, Decision Tree, Multilayer Perceptron, Light Gradient Boosting Decision Tree (LightGBDT)) were developed to model the travel mode choices of travelers using three years of Dutch National Travel Survey data. Empirical results of various performance evaluation metrics (overall accuracy, average precision, precision-recall curves) showed that LightGBDT outperformed other models for both under and oversampling strategies. To overcome the blackbox criticism of ML models and to improve their interpretability, variable importance and SHAP dependency analysis were also conducted. The analysis showed that predictors that significantly influence the travel mode decisions of travelers include trip distance, travelers' age and annual income, number of cars/bicycles owned, and trip density. The results can be used for better understanding and effective modeling of travelers' mode choice preferences.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available