☆ 4.6 Article

Training Data Selection by Categorical Variables for Better Rare Event Prediction in Multiple Products Production Line

ELECTRONICS (2022)

期刊

ELECTRONICS

卷 11, 期 7, 页码 -

出版社

MDPI

DOI: 10.3390/electronics11071056

关键词

multivariate time series; categorical variables; Euclidian distance matrix; integrated feature representation; autoencoder

类别

Computer Science, Information Systems Engineering, Electrical & Electronic Physics, Applied

资金

National Natural Science Foundation of China, China [51775108]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Manufacturers struggle to predict rare events using data from multiple products production lines, and there is little research on quantitatively selecting training data. This study proposes a training data selection method to improve the performance of deep learning models, which can measure the similarities between multivariate time series using categorical variables.

Manufacturers are struggling to use data from multiple products production lines to predict rare events. Improving the quality of training data is a common way to improve the performance of algorithms. However, there is little research about how to select training data quantitatively. In this study, a training data selection method is proposed to improve the performance of deep learning models. The proposed method can represent different time length multivariate time series spilt by categorical variables and measure the (dis)similarities by the distance matrix and clustering method. The contributions are: (1) The proposed method can find the changes to the training data caused by categorical variables in a multivariate time series dataset; (2) according to the proposed method, the multivariate time series data from the production line can be clustered into many small training datasets; and (3) same structure but different parameters prediction models are built instead of one model which is different from the traditional way. In practice, the proposed method is applied in a real multiple products production line dataset and the result shows it can not only significantly improve the performance of the reconstruction model but it can also quantitively measure the (dis)similarities of the production behaviors.

Training Data Selection by Categorical Variables for Better Rare Event Prediction in Multiple Products Production Line

期刊

ELECTRONICS

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Training Data Selection by Categorical Variables for Better Rare Event Prediction in Multiple Products Production Line

期刊

ELECTRONICS

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文