4.7 Article

Raw Wind Data Preprocessing: A Data-Mining Approach

Journal

IEEE TRANSACTIONS ON SUSTAINABLE ENERGY
Volume 6, Issue 1, Pages 11-19

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSTE.2014.2355837

Keywords

Data mining; data preprocessing; local outlier factor (LOF); unsupervised learning

Funding

  1. National High Technology Research and Development Program of China [2011AA05A112]
  2. National Natural Science Foundation of China [51190101]
  3. Science and Technology Projects of the State Grid Corporation of China [SGHN0000DKJS1300221]
  4. Hunan Electric Power Corporation
  5. Ningxia Electric Power Corporation

Ask authors/readers for more resources

Wind energy integration research generally relies on complex sensors located at remote sites. The procedure for generating high-level synthetic information from databases containing large amounts of low-level data must therefore account for possible sensor failures and imperfect input data. The data input is highly sensitive to data quality. To address this problem, this paper presents an empirical methodology that can efficiently preprocess and filter the raw wind data using only aggregated active power output and the corresponding wind speed values at the wind farm. First, raw wind data properties are analyzed, and all the data are divided into six categories according to their attribute magnitudes from a statistical perspective. Next, the weighted distance, a novel concept of the degree of similarity between the individual objects in the wind database and the local outlier factor (LOF) algorithm, is incorporated to compute the outlier factor of every individual object, and this outlier factor is then used to assess which category an object belongs to. Finally, the methodology was tested successfully on the data collected from a large wind farm in northwest China.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available