4.6 Article

Utility-Driven k-Anonymization of Public Transport User Data

Journal

IEEE ACCESS
Volume 9, Issue -, Pages 23608-23623

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3055505

Keywords

Clustering; k-anonymity; privacy; tap-in tap-out transportation data; utility

Funding

  1. Singapore National Research Foundation through the Campus for Research Excellence and Technological Enterprise (CREATE) Programme

Ask authors/readers for more resources

The article introduces a k-anonymity approach that prioritizes attribute generalization based on their utility, focusing on transportation data. The approach incorporates normalization techniques, distance and utility metrics, and considers numerical and categorical attributes. Evaluation with real-world data shows the importance of considering attribute utility in generalization and how suppression of records can impact data utility unexpectedly.
In this article, we propose a k-anonymity approach that prioritizes the generalization of attributes based on their utility. We focus on transport data, which we consider a special case in which many or all attributes are quasi-identifiers (e.g., origin, destination, ride start time), as they allow correlation with easily observable auxiliary data. The novelty in our approach lies in introducing normalization techniques as well as distance and utility metrics that allow the consideration of not only numerical attributes but also categorical attributes by representing them in tree or graph form. The prioritization of the attributes in the generalization process is based on the attributes' utility and can further be influenced by either automatically or manually assigned attribute weights. We evaluate and compare different options for all components of our mechanism as well as present an extensive performance evaluation of our approach using real-world data. Lastly, we show in which cases suppression of records can counter-intuitively lead to higher data utility.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available