4.4 Article

A classification of public transit users with smart card data based on time series distance metrics and a hierarchical clustering method

期刊

TRANSPORTMETRICA A-TRANSPORT SCIENCE
卷 16, 期 1, 页码 56-75

出版社

TAYLOR & FRANCIS LTD
DOI: 10.1080/23249935.2018.1479722

关键词

Public transportation data; smart card users' behavior; time-series classification; cross-correlation; dynamic time warping

资金

  1. Thales Group
  2. Natural Sciences and Engineering Research Council of Canada (NSERC) [RDCPJ 446107-12]

向作者/读者索取更多资源

A classification of the behavior of smart card users is important in the field of public transit demand analysis. It provides an understanding of people's sequence of activities within a period of time. However, classical metrics such as Euclidean distance is not appropriate when dealing with time-series classification. To solve this problem, in this article a method for the classification of public transit smart card users' daily transactions, which are represented in time series, is presented. The chosen approach uses cross-correlation distance (CCD), hierarchical clustering, and subgroups by metric parameter to understand the users' temporal patterns. The clustering results are compared with dynamic time warping (DTW) distance (a common method to measure time-series distance). After a brief pedagogical example to explain the DTW and CCD concepts, a program is developed in R to validate the method on a real dataset of smart card data transactions. The dataset concerns the use of the public transit system in the city of Gatineau in September 2013. The results demonstrate that CCD performs better than DTW to classify the time series, and that the classification method identifies different public transit users' daily behaviors. The results will help transit authorities to offer better services for smart card users from diverse groups.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据