4.6 Article

Incremental density-based ensemble clustering over evolving data streams

期刊

NEUROCOMPUTING
卷 191, 期 -, 页码 34-43

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2016.01.009

关键词

Ensemble clustering; Data streams; Density-based clustering; Smart grid

资金

  1. National Natural Science Foundation of China [61473194]
  2. Science and Technology Planning Project of Guangdong Province of China [2013B091300019]
  3. Peacock Plan [KOCX201208161601439]

向作者/读者索取更多资源

The recent advances in smart meter technology have enabled for collecting information about customer power consumption in real time. The measurements are generated continuously and in some cases, e.g. in the industrial smart metering the data exchange rates are highly-fluctuating. The storage, querying, and mining of such smart meter streaming data with a large number of missing and sparse values are highly computationally challenging tasks. To address such matters, we propose a new method called incremental density-based ensemble clustering (IDEStream) for incremental segmentation of various kinds of factories based on their electricity consumption data. It exploits a gamma mixture model to suppress the influence of sparse data units in the data streams that sequentially arrive within a time window and then generates a clustering from the processed data of that window. IDEStream uses a unique incremental ensemble approach to incrementally aggregate the clusterings of subsequent time windows. Experimental results on data streams collected by smart meters from manufacturing factories in Guangdong province of China have shown that the proposed algorithm outperforms several state-ofthe-art data stream clustering algorithms. The obtained segmentation can find numerous applications, an exemplar one being to define customer rates in a flexible way. (C) 2016 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据