4.8 Article

Semisupervised Robust Modeling of Multimode Industrial Processes for Quality Variable Prediction Based on Student's t Mixture Model

Journal

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
Volume 16, Issue 5, Pages 2965-2976

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TII.2019.2902560

Keywords

Expectation-maximization (EM); multimode industrial process; robust modeling; semisupervised learning; soft sensor; Student's t mixture model (SMM)

Funding

  1. National Natural Science Foundation of China [61703367]
  2. Natural Science Foundation of Zhejiang Province [LR18F030001]
  3. China Postdoctoral Science Foundation [2017M621929]
  4. Fundamental Research Funds for the Central Universities [2018XZZX002-09]

Ask authors/readers for more resources

Gaussian mixture model (GMM) has been widely used for soft sensor modeling of multimode industrial processes. However, it has been recognized that the performance of GMM deteriorates with the presence of outliers, which commonly exist in industrial datasets. In addition, samples with known labels in soft sensor applications are often rare because of expensive sampling equipment or time-consuming laboratory analysis. Shortage of labeled samples could lead GMM-based models to low prediction accuracy. To tackle such problems, a semisupervised robust soft sensor modeling method called semisupervised Student's t mixture model (SsSMM) is proposed. Like the GMM, the SsSMM employs finite mixture models to learn data distributions; nevertheless, with the virtual of the long tail property of Student's t distribution, the SsSMM possesses stronger robustness against outliers compared with the GMM. Moreover, the semisupervised model structure of SsSMM enables exploiting unlabeled samples of the SsSMM, such that the issues caused by insufficient labeled samples can be tackled. To identify model parameters of the SsSMM, we also develop an expectation-maximization-based training algorithm. Experimental results on numerical and industrial examples demonstrate that the proposed method is effective in: first, modeling multimode characteristics; second, exploiting unlabeled samples for performance improvement; third, dealing with distinct outliers (in synthetic dataset) and indistinctive outliers (in industrial dataset).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available