4.5 Article

Improving Multivariate Microaggregation through Hamiltonian Paths and Optimal Univariate Microaggregation

期刊

SYMMETRY-BASEL
卷 13, 期 6, 页码 -

出版社

MDPI
DOI: 10.3390/sym13060916

关键词

microaggregation; statistical disclosure control; graph theory; traveling salesman problem; data privacy; location privacy

资金

  1. European Commission under the Horizon 2020 Programme (H2020), project LOCARD [832735]
  2. Spanish Ministry of Science & Technology with project IoTrain [RTI2018-095499-B-C32]
  3. H2020 Societal Challenges Programme [832735] Funding Source: H2020 Societal Challenges Programme

向作者/读者索取更多资源

The collection of personal data is increasing rapidly, leading to privacy risks. To address this, various techniques have been proposed, with microaggregation being a popular method. This article introduces a heuristic solution inspired by the Traveling Salesman Problem and the optimal univariate microaggregation solution to tackle the multivariate microaggregation problem efficiently and effectively.
The collection of personal data is exponentially growing and, as a result, individual privacy is endangered accordingly. With the aim to lessen privacy risks whilst maintaining high degrees of data utility, a variety of techniques have been proposed, being microaggregation a very popular one. Microaggregation is a family of perturbation methods, in which its principle is to aggregate personal data records (i.e., microdata) in groups so as to preserve privacy through k-anonymity. The multivariate microaggregation problem is known to be NP-Hard; however, its univariate version could be optimally solved in polynomial time using the Hansen-Mukherjee (HM) algorithm. In this article, we propose a heuristic solution to the multivariate microaggregation problem inspired by the Traveling Salesman Problem (TSP) and the optimal univariate microaggregation solution. Given a multivariate dataset, first, we apply a TSP-tour construction heuristic to generate a Hamiltonian path through all dataset records. Next, we use the order provided by this Hamiltonian path (i.e., a given permutation of the records) as input to the Hansen-Mukherjee algorithm, virtually transforming it into a multivariate microaggregation solver we call Multivariate Hansen-Mukherjee (MHM). Our intuition is that good solutions to the TSP would yield Hamiltonian paths allowing the Hansen-Mukherjee algorithm to find good solutions to the multivariate microaggregation problem. We have tested our method with well-known benchmark datasets. Moreover, with the aim to show the usefulness of our approach to protecting location privacy, we have tested our solution with real-life trajectories datasets, too. We have compared the results of our algorithm with those of the best performing solutions, and we show that our proposal reduces the information loss resulting from the microaggregation. Overall, results suggest that transforming the multivariate microaggregation problem into its univariate counterpart by ordering microdata records with a proper Hamiltonian path and applying an optimal univariate solution leads to a reduction of the perturbation error whilst keeping the same privacy guarantees.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据