4.6 Article

An Approach to Data Acquisition for Urban Building Energy Modeling Using a Gaussian Mixture Model and Expectation-Maximization Algorithm

Journal

BUILDINGS
Volume 11, Issue 1, Pages -

Publisher

MDPI
DOI: 10.3390/buildings11010030

Keywords

gaussian mixture model; Expectation-Maximization; urban building energy modeling; data acquisition

Funding

  1. IMMA research network between Microdata Analysis
  2. Energy and Built Environments in the School of Technology and Business Studies at Dalarna University
  3. UBMEM project at the Swedish Energy Agency [46068]

Ask authors/readers for more resources

In recent years, uncertainties in building energy performance have increased, posing challenges in data collection and establishing data pipelines at an urban scale. However, the use of Gaussian mixture models and Expectation-Maximization algorithm to generate synthetic data points has shown consistent representation with real data.
In recent years, a building's energy performance is becoming uncertain because of factors such as climate change, the Covid-19 pandemic, stochastic occupant behavior and inefficient building control systems. Sufficient measurement data is essential to predict and manage a building's performance levels. Assessing energy performance of buildings at an urban scale requires even larger data samples in order to perform an accurate analysis at an aggregated level. However, data are not only expensive, but it can also be a real challenge for communities to acquire large amounts of real energy data. This is despite the fact that inadequate knowledge of a full population will lead to biased learning and the failure to establish a data pipeline. Thus, this paper proposes a Gaussian mixture model (GMM) with an Expectation-Maximization (EM) algorithm that will produce synthetic building energy data. This method is tested on real datasets. The results show that the parameter estimates from the model are stable and close to the true values. The bivariate model gives better performance in classification accuracy. Synthetic data points generated by the models show a consistent representation of the real data. The approach developed here can be useful for building simulations and optimizations with spatio-temporal mapping.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available