4.5 Article

Outlier-eliminated k-means clustering algorithm based on differential privacy preservation

期刊

APPLIED INTELLIGENCE
卷 45, 期 4, 页码 1179-1191

出版社

SPRINGER
DOI: 10.1007/s10489-016-0813-z

关键词

Differential privacy (DP) preservation; k-means clustering; Outlier; OEDP

资金

  1. National Natural Science Foundation of China [61370050]
  2. Natural Science Foundation of Anhui Province [1508085QF134]

向作者/读者索取更多资源

Individual privacy may be compromised during the process of mining for valuable information, and the potential for data mining is hindered by the need to preserve privacy. It is well known that k-means clustering algorithms based on differential privacy require preserving privacy while maintaining the availability of clustering. However, it is difficult to balance both aspects in traditional algorithms. In this paper, an outlier-eliminated differential privacy (OEDP) k-means algorithm is proposed that both preserves privacy and improves clustering efficiency. The proposed approach selects the initial centre points in accordance with the distribution density of data points, and adds Laplacian noise to the original data for privacy preservation. Both a theoretical analysis and comparative experiments were conducted. The theoretical analysis shows that the proposed algorithm satisfies epsilon-differential privacy. Furthermore, the experimental results show that, compared to other methods, the proposed algorithm effectively preserves data privacy and improves the clustering results in terms of accuracy, stability, and availability.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据