4.6 Article

Machine Learning-Based Ensemble Classifiers for Anomaly Handling in Smart Home Energy Consumption Data

期刊

SENSORS
卷 22, 期 23, 页码 -

出版社

MDPI
DOI: 10.3390/s22239323

关键词

classification; data anomalies; data imputation; energy consumption data; ensemble classifiers; machine learning; smart home data; smart meter data; tracebase dataset

向作者/读者索取更多资源

This paper proposes an ML-based ensemble classifiers approach to address anomalies in smart home energy consumption data. By identifying and removing anomalies, and imputing missing information, more accurate data analysis is achieved. The study finds that the ensemble classifier RF+SVM+DT performs superior in anomaly handling.
Addressing data anomalies (e.g., garbage data, outliers, redundant data, and missing data) plays a vital role in performing accurate analytics (billing, forecasting, load profiling, etc.) on smart homes' energy consumption data. From the literature, it has been identified that the data imputation with machine learning (ML)-based single-classifier approaches are used to address data quality issues. However, these approaches are not effective to address the hidden issues of smart home energy consumption data due to the presence of a variety of anomalies. Hence, this paper proposes ML-based ensemble classifiers using random forest (RF), support vector machine (SVM), decision tree (DT), naive Bayes, K-nearest neighbor, and neural networks to handle all the possible anomalies in smart home energy consumption data. The proposed approach initially identifies all anomalies and removes them, and then imputes this removed/missing information. The entire implementation consists of four parts. Part 1 presents anomaly detection and removal, part 2 presents data imputation, part 3 presents single-classifier approaches, and part 4 presents ensemble classifiers approaches. To assess the classifiers' performance, various metrics, namely, accuracy, precision, recall/sensitivity, specificity, and F1 score are computed. From these metrics, it is identified that the ensemble classifier RF+SVM+DT has shown superior performance over the conventional single classifiers as well the other ensemble classifiers for anomaly handling.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据