4.7 Article

Imbalanced fault diagnosis based on semi-supervised ensemble learning

Journal

JOURNAL OF INTELLIGENT MANUFACTURING
Volume 34, Issue 7, Pages 3143-3158

Publisher

SPRINGER
DOI: 10.1007/s10845-022-01985-2

Keywords

Imbalanced fault diagnosis; Ensemble learning; Oversampling; Semi-supervised learning

Ask authors/readers for more resources

This study proposes a semi-supervised ensemble learning method for imbalanced fault diagnosis. It evaluates sample information and presents a novel synthetic minority oversampling technique to balance the labeled dataset. It also utilizes co-training technique to exploit information from the unlabeled dataset, improving the performance of fault diagnosis.
The imbalance of fault modes prevails in industrial equipment monitoring. Many methods were presented for imbalanced fault diagnosis only by resampling labeled fault dataset, which limited the diagnostic performance due to information loss from unlabeled fault dataset. To perfectly exploit the information from unlabeled and labeled datasets, this study proposed a semi-supervised ensemble learning method termed as SSTI for imbalanced fault diagnosis. First, the sample information was evaluated based on Mahalanobis distance, and a novel sample information-based synthetic minority oversampling technique (SI-SMOTE) was presented for balancing the labeled dataset. Second, the tri-training architecture-based imbalanced co-training technique (Tri-ImCT) was developed to exploit the information contained in the unlabeled dataset. In the Tri-ImCT, rebalancing the training subsets and variable weighted voting were utilized to improve the performance of proposed method for imbalanced fault diagnosis. To verify the performance of proposed method, several experiments were carried out on several imbalanced datasets derived from two bearing datasets and one subway wheel dataset. We utilized three indicators of G-mean, average precision, and average F-score for evaluating the performance of classifiers. Experimental results show that the performance of proposed method exceeds that of other methods, which is very close to the upper bound of fully-supervised performance. It substantially indicates that this study provides a very promising methodology for imbalanced fault diagnosis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available