4.8 Article

Imbalanced Classification Based on Minority Clustering Synthetic Minority Oversampling Technique With Wind Turbine Fault Detection Application

Journal

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
Volume 17, Issue 9, Pages 5867-5875

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TII.2020.3046566

Keywords

Classification; imbalanced data; K-means algorithm; synthetic minority oversampling technique (SMOTE); wind turbine fault detection

Funding

  1. National Natural Science Foundation of China [61973119]
  2. Shanghai Rising-Star Program [20QA1402600]
  3. Programme of Introducing Talents of Discipline to Universities (the 111 Project) [B17017]

Ask authors/readers for more resources

The article proposed a minority clustering SMOTE (MC-SMOTE) method to improve the imbalance classification performance and verified its superiority through experiments on benchmark datasets and real industrial data.
Synthetic minority oversampling technique (SMOTE) has been widely used in dealing with the imbalance classification problem in the machine learning field. However, classical SMOTE implements the oversampling by linear interpolation between adjacent minority class samples, which may fail to consider the uneven distribution of the samples. This article proposes a minority clustering SMOTE (MC-SMOTE) method that involves the clustering of minority class samples to improve the imbalance classification performance. First, samples from the minority class are clustered into several clusters. Second, oversampling is performed by linear interpolation between adjacent clusters to create new samples from different clusters that contain additional information of the entire minority class. Then classical classification techniques can be employed to achieve efficient classification. The superiority of the MC-SMOTE is first verified by experiments on some benchmark datasets from various application domains. The proposed method is then applied to the real industrial SCADA data of wind turbine blade icing. Classification results indicate that the MC-SMOTE exhibits a better performance than that of the classical SMOTE.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available