4.5 Article

Wind Turbine Fault Detection Using Highly Imbalanced Real SCADA Data

Journal

ENERGIES
Volume 14, Issue 6, Pages -

Publisher

MDPI
DOI: 10.3390/en14061728

Keywords

fault detection; machine learning; principal component analysis; SCADA; structural health monitoring; wind turbine; imbalanced data; support vector machines; k nearest neighbors

Categories

Funding

  1. Spanish Agencia Estatal de Investigacion (AEI)-Ministerio de Economia, Industria y Competitividad (MINECO)
  2. Fondo Europeo de Desarrollo Regional (FEDER) [DPI2017-82930-C2-1-R]
  3. Generalitat de Catalunya [2017 SGR 388]
  4. NVIDIA Corporation

Ask authors/readers for more resources

Wind power is a cleaner and cheaper energy source compared to others, but challenges related to operation and maintenance of wind farms contribute to increased costs. A fault detection methodology is proposed in this paper to improve alarm detection for wind turbine gearboxes by applying data analysis and processing techniques to real SCADA data.
Wind power is cleaner and less expensive compared to other alternative sources, and it has therefore become one of the most important energy sources worldwide. However, challenges related to the operation and maintenance of wind farms significantly contribute to the increase in their overall costs, and, therefore, it is necessary to monitor the condition of each wind turbine on the farm and identify the different states of alarm. Common alarms are raised based on data acquired by a supervisory control and data acquisition (SCADA) system; however, this system generates a large number of false positive alerts, which must be handled to minimize inspection costs and perform preventive maintenance before actual critical or catastrophic failures occur. To this end, a fault detection methodology is proposed in this paper; in the proposed method, different data analysis and data processing techniques are applied to real SCADA data (imbalanced data) for improving the detection of alarms related to the temperature of the main gearbox of a wind turbine. An imbalanced dataset is a classification data set that contains skewed class proportions (more observations from one class than the other) which can cause a potential bias if it is not handled with caution. Furthermore, the dataset is time dependent introducing an additional variable to deal with when processing and splitting the data. These methods are aimed to reduce false positives and false negatives, and to demonstrate the effectiveness of well-applied preprocessing techniques for improving the performance of different machine learning algorithms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available