4.7 Article

Machine learning-based estimation of buildings' characteristics employing electrical and chilled water consumption data: Pipeline optimization

Journal

ENERGY AND BUILDINGS
Volume 295, Issue -, Pages -

Publisher

ELSEVIER SCIENCE SA
DOI: 10.1016/j.enbuild.2023.113327

Keywords

Smart meter; Commercial buildings classification; Machine learning; Feature extraction; Feature selection; Pipeline optimization

Ask authors/readers for more resources

Smart meter-driven remote auditing allows rapid identification of buildings with low energy performance. This study focuses on using ML-based pipelines to characterize buildings and optimize their performance using electrical and chilled-water consumption data. Results show that optimizing the pipelines improves model accuracy and interpretability, and adding features from chilled-water consumption data further enhances accuracy and reduces feature count.
Smart meter-driven remote auditing of buildings, as an alternative to the labor-intensive on-site visits, permits large-scale and rapid identification of buildings with low energy performance. The existing literature has mainly focused on electricity meters' data from a rather small set of buildings and efforts have often not been made to facilitate the models' physical interpretability. Accordingly, the present work focuses on the implementation and optimization of ML-based pipelines for building characterization (by use type (A), performance class (B), and operation group (C)) employing hourly electrical and chilled-water consumption data. Utilizing the Building Data Genome Project II dataset (with data from 1636 buildings), feature generation, feature selection, and pipeline optimization steps are performed for each pipeline. Results demonstrate that performing the latter two steps improves the model's accuracy (5.3%, 2.9%, and 3.9% for pipelines A, B, and C compared to a benchmark model), while notably reduces the number of utilized features (94.7%, 88.3%, 89.4%), enhancing the models' interpretability. Furthermore, adding features extracted from chilled-water consumption data boosts the accuracy (with respect to baseline) for the second subset by 12.4%, 13.5%, and 7.2%, while decreasing the feature count by 97.2%, 96.4%, and 96.5%, respectively.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available