4.6 Article

Big Machinery Data Preprocessing Methodology for Data-Driven Models in Prognostics and Health Management

Journal

SENSORS
Volume 21, Issue 20, Pages -

Publisher

MDPI
DOI: 10.3390/s21206841

Keywords

data preprocessing; machinery data; big data; monitoring sensor data; data reduction; machine learning; deep learning; diagnosis; prognostics and health management

Funding

  1. Petroleum Institute, Khalifa University of Science and Technology, Abu Dhabi, UAE

Ask authors/readers for more resources

This paper discusses the impact of sensor monitoring networks and big data analytics on reliability engineering, as well as the application of data-driven models for diagnostics and prognostics in complex systems. It also highlights the lack of attention to data preprocessing steps in current research for real operating systems.
Sensor monitoring networks and advances in big data analytics have guided the reliability engineering landscape to a new era of big machinery data. Low-cost sensors, along with the evolution of the internet of things and industry 4.0, have resulted in rich databases that can be analyzed through prognostics and health management (PHM) frameworks. Several data-driven models (DDMs) have been proposed and applied for diagnostics and prognostics purposes in complex systems. However, many of these models are developed using simulated or experimental data sets, and there is still a knowledge gap for applications in real operating systems. Furthermore, little attention has been given to the required data preprocessing steps compared to the training processes of these DDMs. Up to date, research works do not follow a formal and consistent data preprocessing guideline for PHM applications. This paper presents a comprehensive step-by-step pipeline for the preprocessing of monitoring data from complex systems aimed for DDMs. The importance of expert knowledge is discussed in the context of data selection and label generation. Two case studies are presented for validation, with the end goal of creating clean data sets with healthy and unhealthy labels that are then used to train machinery health state classifiers.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available