4.7 Article

Developing stacking ensemble models for multivariate contamination detection in water distribution systems

Journal

SCIENCE OF THE TOTAL ENVIRONMENT
Volume 828, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.scitotenv.2022.154284

Keywords

Contamination detection; Ensemble modeling; Machine learning; Stacking modeling; Water distribution system; Water quality

Funding

  1. National Key Research and Develop-ment Program of China [2019YFC1407701]
  2. National Natural Science Foundation of China [51925902, 52079016, 52122901]
  3. Liaoning Natural Science Foundation [2019-MS-043]
  4. Funda-mental Research Funds for the Central Universities [DUT21GJ203]
  5. UK Royal Society [IF160108, IEC\NSFC\170249]
  6. Alan Turing Institute under the EPSRC [EP/N510129/1]
  7. China Scholarship Council

Ask authors/readers for more resources

This study presents a new stacking ensemble model that uses multiple water quality parameters for contamination event detection. The proposed method outperforms an artificial neural network (ANN) benchmark method in terms of detection accuracy. The results demonstrate the great potential of the stacking method for detecting contamination events in water distribution systems.
This study presents a new stacking ensemble model for contamination event detection using multiple water quality parameters. The stacking model consists of a number of machine learning base predictors and a meta-predictor, and it is trained using cross-validation to capture different features in multiple water quality parameters and then used for water quality predictions. For each water quality parameter, the residuals between predicted and measured data arc classified to identify anomalies with thresholds derived from the sequential model-based optimization method and detection probabilities updated using Bayesian analysis. Alarms derived from individual water quality parameters are fused to enhance the anomaly signals and improve the detection accuracy. The proposed stacking-based method is evaluated using a data set of six water quality parameters from a real water distribution system with randomly simulated events. The stacking-based method could detect 2496 events out of a total 2500 events without a false alarm. The results show that the stacking method outperforms an artificial neural network (ANN) benchmark method in contamination event detection. The stacking method has a higher true positive rate, lower false positive rate and higher F1 score than the ANN method. This implies that the stacking method has great promise of detecting contamination events in the water distribution system.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available