4.7 Article

Machine learning-based estimation of buildings' characteristics employing electrical and chilled water consumption data: Pipeline optimization

期刊

ENERGY AND BUILDINGS
卷 295, 期 -, 页码 -

出版社

ELSEVIER SCIENCE SA
DOI: 10.1016/j.enbuild.2023.113327

关键词

Smart meter; Commercial buildings classification; Machine learning; Feature extraction; Feature selection; Pipeline optimization

向作者/读者索取更多资源

Smart meter-driven remote auditing allows rapid identification of buildings with low energy performance. This study focuses on using ML-based pipelines to characterize buildings and optimize their performance using electrical and chilled-water consumption data. Results show that optimizing the pipelines improves model accuracy and interpretability, and adding features from chilled-water consumption data further enhances accuracy and reduces feature count.
Smart meter-driven remote auditing of buildings, as an alternative to the labor-intensive on-site visits, permits large-scale and rapid identification of buildings with low energy performance. The existing literature has mainly focused on electricity meters' data from a rather small set of buildings and efforts have often not been made to facilitate the models' physical interpretability. Accordingly, the present work focuses on the implementation and optimization of ML-based pipelines for building characterization (by use type (A), performance class (B), and operation group (C)) employing hourly electrical and chilled-water consumption data. Utilizing the Building Data Genome Project II dataset (with data from 1636 buildings), feature generation, feature selection, and pipeline optimization steps are performed for each pipeline. Results demonstrate that performing the latter two steps improves the model's accuracy (5.3%, 2.9%, and 3.9% for pipelines A, B, and C compared to a benchmark model), while notably reduces the number of utilized features (94.7%, 88.3%, 89.4%), enhancing the models' interpretability. Furthermore, adding features extracted from chilled-water consumption data boosts the accuracy (with respect to baseline) for the second subset by 12.4%, 13.5%, and 7.2%, while decreasing the feature count by 97.2%, 96.4%, and 96.5%, respectively.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据