4.8 Article

Enhancing Privacy and Availability for Data Clustering in Intelligent Electrical Service of IoT

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 6, Issue 2, Pages 1530-1540

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2018.2842773

Keywords

Data clustering; differential privacy; Internet of Things (IoT); k-means algorithm; privacy protection

Funding

  1. National Natural Science Foundation of China [61502489, 61402109, 61370078, 61502102, 61502103]
  2. Chongqing Key Laboratory of Optical Communication and Networks Research Fund [KLOCN2018001]

Ask authors/readers for more resources

The ever-growing demand for electrical energy of sensing devices in the Internet of Things (IoT) has led to generating large amounts of electricity consumption data. Electricity service providers often use wireless sensor networks to collect sensing devices' electricity consumption data for statistical analysis, so as to provide sensing devices with improved electrical services. As an important data mining technique, while data clustering excels in dealing with such massive data, it imposes the risk of privacy disclosure in the process of data clustering. In an effort of solving this problem, Blum et al. proposed a differential privacy k-means algorithm, effectively preventing privacy disclosure. However, the availability of data clustering results is reduced due to the data distortion in Blum's algorithm. In this paper, we propose a privacy and availability data clustering (PADC) scheme based on k-means algorithm and differential privacy, which enhances the selection of the initial center points and the distance calculation method from other points to center point. Moreover, PADC attempts to reduce the outlier effect through detecting outliers during the clustering process. Security analysis indicates that our scheme satisfies the goal of differential privacy and prevents privacy information disclosure. Meanwhile, performance evaluation shows that our scheme, at the same privacy level, improves the availability of clustering results compared to the existing differential privacy k-means algorithms, suggesting that our proposed PADC scheme outperforms others for intelligent electrical service in IoT.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available