4.7 Article

Towards the Inference of Travel Purpose with Heterogeneous Urban Data

Journal

IEEE TRANSACTIONS ON BIG DATA
Volume 8, Issue 1, Pages 166-177

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TBDATA.2019.2921823

Keywords

Dynamic Bayesian network; trajectory; social media; point of interest; trip purpose

Funding

  1. US National Science Foundation [IIS-1553411, CNS-1737590]
  2. Region II University Transportation Center
  3. Transportation Informatics (TransInfo) University Transportation Center at University at Buffalo

Ask authors/readers for more resources

In this paper, the authors propose a method to infer individuals' trip purposes by combining knowledge from various data sources, including trajectories, POIs, and social media data. The proposed Dynamic Bayesian Network model captures the sequential properties of trip activities and the popularity of trip end areas. Additionally, an efficient method is proposed to identify POIs from social media messages and learn their popularities. To address the data imbalance problem, an ensemble DBN method with sampling technique is proposed. Furthermore, an incremental DBN method is proposed to handle the continuous collection of trip data. Experimental results demonstrate the advantages of the proposed method in correctly inferring trip purposes.
In people's daily lives, travel takes up an important part, and many trips are generated everyday, such as going to school or shopping. With the widely adoption of GPS-integrated devices, a large amount of trips can be recorded with GPS trajectories. These trajectories are represented by sequences of geo-coordinates and can help us answer simple questions such as where did you go. However, there is another important question awaiting to be answered, that is what did/will you do, i.e., the trip purpose inference. In practice, people's trip purposes are very important in understanding travel behaviors and estimating travel demands. Obviously, it is very challenging to infer trip purposes solely based on the trajectories, because the GPS devices are not accurate enough to pinpoint the venues visited. In this paper, we infer individual's trip purposes by combining the knowledge from heterogeneous data sources including trajectories, POls and social media data. The proposed Dynamic Bayesian Network model (DBN) captures three important factors: the sequential properties of trip activities, the functionality and POI popularity of trip end areas. In addition, we propose an efficient method with local candidate pools to identify POIs from geo-tagged social media messages, and learn the POI popularities from nearby social media data. Moreover, trip data is usually imbalanced across different activities. This data imbalance problem can cause serious challenges because the DBN model could be biased by those popular class labels. Considering this challenge, we propose an ensemble DBN method with sampling technique (eDBN) which results in more accurate inference. Furthermore, real-world trip data are continuously collected on a daily basis. The batch model would result in unnecessary computation because historical data need to be revisited. We handle this problem by proposing an incremental DBN method (iDBN) which is both effective and efficient. Extensive experiments are conducted on real-world data sets with trajectories of 8,361 residents and the 6.9 million geo-tagged tweets in the Bay area. Experimental results demonstrate the advantages of the proposed method on correctly inferring the trip purposes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available