4.7 Article

Comparative study of feature selection methods for wind speed estimation at ungauged locations

Journal

ENERGY CONVERSION AND MANAGEMENT
Volume 291, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.enconman.2023.117324

Keywords

Exceedance probability; Feature selection; Machine learning; Topographic feature; Ungauged location; Wind speed

Ask authors/readers for more resources

This paper evaluates six feature selection methods for estimating different wind speed quantiles across Canada. The results show that LASSO and MRMR are the most efficient algorithms with few parameters to tune and good generalization performance. The study also finds that certain predictors are more important for specific exceedance probabilities, and distance from the coast and surface roughness length are the most important predictors regardless of exceedance probability.
Wind speed estimation at ungauged locations is one of the preliminary steps for wind resource assessment. With the availability of high-resolution Digital Elevation Models (DEM) and remote sensing data, the number of potential wind speed predictors has grown substantially. The adequate spatial scale of these predictors is unknown a priori, leading to the use of multiple spatial scales of predictors in wind speed estimation models. Implementing a feature selection method as a pre-processing step of the analysis is necessary to avoid overfitting and the resulting potential model underperformance. This paper evaluated six feature selection methods (forward stepwise regression, Least Absolute Shrinkage and Selection Operator (LASSO), Elastic Net, maximum relevance minimum redundancy (MRMR), Genetic algorithm, and recursive feature elimination using support vector regression) for the estimation of different wind speed quantiles across Canada. The selected features were used to fit a regression-kriging model, and the importance of the predictors was evaluated with their associated regression coefficients. The results of the study showed that LASSO and MRMR are the most efficient algorithms with the least number of parameters to tune and good generalization performance. The study found that some predictors were more important for specific exceedance probabilities. The most important predictors were the distance from the coast and surface roughness length, regardless of exceedance probability.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available