4.7 Article

A novel graph-based missing values imputation method for industrial lubricant data

期刊

COMPUTERS IN INDUSTRY
卷 150, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.compind.2023.103937

关键词

Lubricant formulation; Missing values; Imputation method; Graph Convolutional Network

向作者/读者索取更多资源

Missing values in lubricant formulation data are common due to the complexity of lubricant manufacturing. To obtain meaningful information, such as correlations and patterns, it is crucial to impute missing values using statistical analysis and data mining. Traditional methods neglect the latent structure and only consider feature information, while the proposed graph-based imputation method (GBIM) considers both feature information and data point relations, achieving higher imputation accuracy compared to traditional methods.
Missing values are unavoidable in lubricant formulation data in the chemical industry owing to the complexity of lubricant manufacturing. Therefore, imputing missing values using statistical analysis and data mining is essential to obtain meaningful information such as correlations and patterns. Traditional methods, such as random forest (RF), k-nearest neighbors (k-NN), support vector machine (SVM), and deep neural networks (DNNs), have been employed for imputing missing values. However, these traditional methods neglect the latent structure because they only consider the feature information of the data. To this end, this study proposed a novel graph-based imputation method (GBIM) considering the feature information and the relations between data points to improve model performance. The proposed GBIM expresses the relation between each data point via a graph by consulting with dependency modeling and imputes missing values using a graph convolutional network (GCN). Experiments were performed for four physical properties in a lubricant formulation dataset. The results using GBIM were compared with those of traditional imputation methods (RF, k-NN, SVM, and DNN) by considering missing rates at 5% intervals from 5% to 50%. GBIM achieved 4-7% higher imputation accuracy than the other methods. The proposed GBIM can be applied in various industries as a powerful method for imputing missing values.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据