4.6 Article

Enhanced performance Gaussian process regression for probabilistic short-term solar output forecast

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.ijepes.2021.106916

Keywords

Short-term forecasting; Photovoltaic; Gaussian Processes Regression; k-means; Feature selection

Ask authors/readers for more resources

This paper proposes a novel probabilistic framework to predict short-term PV output, taking into account the variability of weather data over different seasons. By using feature selection, clustering, and Gaussian Process Regression, a function relating selected features with solar output is established. Testing on five solar generation plants shows that the proposed method significantly reduces errors compared to existing methodologies.
With increasing concerns of climate change, renewable resources such as photovoltaic (PV) have gained popularity as a means of energy generation. The smooth integration of such resources in power system operations is enabled by accurate forecasting mechanisms that address their inherent intermittency and variability. This paper proposes a novel probabilistic framework to predict short-term PV output taking into account the variability of weather data over different seasons. To this end, we go beyond existing prediction methods, building a pipeline of processes, i.e., feature selection, clustering and Gaussian Process Regression (GPR). We make use of datasets that comprise of power output and meteorological data such as irradiance, temperature, zenith, and azimuth. First, a correlation study is performed to select the weather features which affect solar output to a greater extent. Next, we categorise the data into four groups based on solar output and time by using k-means clustering. Finally, we determine a function that relates the aforementioned selected features with solar output by using GPR and Mate?rn 5/2 as a kernel function. We validate our method with five solar generation plants in different locations and compare the results with existing methodologies. More specifically, in order to test the proposed model, two different methods are used: (i) a 5-fold cross validation; and (ii) holding out 30 random days as test data. To confirm the model accuracy, we apply our framework 30 independent times on each of the four clusters. The average error follows a normal distribution, and with 95% confidence level, it takes values between -1.6% and 1.4%. The proposed framework decreases the normalised root mean square error and mean absolute error by 54.6% and 55.5%, respectively, when compared with other relevant works.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available