4.6 Article

Gated Stacked Target-Related Autoencoder: A Novel Deep Feature Extraction and Layerwise Ensemble Method for Industrial Soft Sensor Application

Journal

IEEE TRANSACTIONS ON CYBERNETICS
Volume 52, Issue 5, Pages 3457-3468

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2020.3010331

Keywords

Logic gates; Feature extraction; Neurons; Data mining; Computational modeling; Process control; Data models; Deep learning (DL); gated neurons; nonlinear feature extraction; soft sensor; stacked autoencoder (SAE); target-related information

Funding

  1. National Natural Science Foundation of China [61722310]
  2. Natural Science Foundation of Zhejiang Province [LR18F030001]

Ask authors/readers for more resources

In this research, the authors aim to address the challenge of extracting effective feature representations from complex process data in the field of soft sensing applications. They propose a deep stacked autoencoder (SAE) approach and introduce a novel gated stacked target-related autoencoder (GSTAE) to improve modeling performance.
These days, data-driven soft sensors have been widely applied to estimate the difficult-to-measure quality variables in the industrial process. How to extract effective feature representations from complex process data is still the difficult and hot spot in the soft sensing application field. Deep learning (DL), which has made great progresses in many fields recently, has been used for process monitoring and quality prediction purposes for its outstanding nonlinear modeling and feature extraction abilities. In this work, deep stacked autoencoder (SAE) is introduced to construct a soft sensor model. Nevertheless, conventional SAE-based methods do not take information related to target values in the pretraining stage and just use the feature representations in the last hidden layer for final prediction. To this end, a novel gated stacked target-related autoencoder (GSTAE) is proposed for improving modeling performance in view of the above two issues. By adding prediction errors of target values into the loss function when executing a layerwise pretraining procedure, the target-related information is used to guide the feature learning process. Besides, gated neurons are utilized to control the information flow from different layers to the final output neuron that take full advantage of different levels of abstraction representations and quantify their contributions. Finally, the effectiveness and feasibility of the proposed approach are verified in two real industrial cases.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available