4.7 Review

Systematic review of class imbalance problems in manufacturing

Journal

JOURNAL OF MANUFACTURING SYSTEMS
Volume 71, Issue -, Pages 620-644

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.jmsy.2023.10.014

Keywords

Manufacturing; Class imbalance; Data manipulation; Machine learning; Deep learning

Ask authors/readers for more resources

This article provides a systematic review of data manipulation, machine learning, and deep learning solutions to the class imbalance problem in the manufacturing domain. It critically evaluates different metrics and explores the availability of public source code and imbalanced datasets for benchmarking. Furthermore, it summarizes the most applied solutions to the class imbalance problem in manufacturing and discusses future challenges.
Class imbalance (CI) is a well-known problem in data science. Nowadays, it is affecting the data modeling of many of the real-world processes that are being digitized. The manufacturing industry turns out to be highly affected by this problem, especially in fault inspection, prediction or monitoring processes, and in all those processes where the production efficiency is high and the data samples of anomalous events are rare. In this work, we systematically review all the data manipulation, machine learning or deep learning solutions to the CI problem in the manufacturing domain. We also critically evaluate all the different metrics that researchers can compare in order to estimate the improvements carried by their proposed solutions, and we look at the availability of public source code and data-imbalanced datasets that can be used for benchmarking. Finally, we summarize the most applied solutions to the CI problem in manufacturing and we look at future challenges. While posing a reference for the best practices at the time of this review, we challenge researchers to standardize the use of data science algorithms for CI in the manufacturing domain.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available