4.7 Article

Inferring trip purpose by clustering sequences of smart card records

Journal

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.trc.2021.103131

Keywords

Trip purpose; Public transport; String similarity; Machine learning; Data mining; Big data

Ask authors/readers for more resources

This paper proposes a novel method to infer the trip purpose attribute from the sequences of trips of passengers in smart card transactions, showing a significant improvement in inferring trip purposes. By discovering clusters of passengers and allocating them to the closest clusters, the trip purpose of smart card transactions was successfully inferred.
Smart card transactions are known as a rich and continuous source of public transit data, but they miss some important attributes about trips and passengers. One of these missing attributes is the trip purpose attribute. This paper proposes a novel method to infer the trip purpose attribute from the sequences of trips of passengers instead of separate trips. The proposed method infers the trip purpose attribute (a missing attribute in the smart card data) from the temporal attributes (available attributes in the smart card data). First, the relation between the temporal attributes and the trip purpose attribute is learnt by discovering clusters of passengers in the Household Travel Survey dataset while each passenger is represented by one sequence of trips. Then, the discovered clusters are utilized to infer the trip purpose of smart card transactions by allocating each passenger to the closest clusters. The proposed method is implemented on the smart card and HTS datasets from southeast Queensland, Australia. The evaluation results showed a considerable improvement in inferring the trip purpose compared to the results published in the literature. Notably, the effect of considering the trip sequence was more significant than considering land use variables.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available