4.7 Article

A machine learning framework to improve effluent quality control in wastewater treatment plants

Journal

SCIENCE OF THE TOTAL ENVIRONMENT
Volume 784, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.scitotenv.2021.147138

Keywords

Wastewater treatment; Big data; Interpretable AI; Effluent quality; Process analytics

Funding

  1. Green TEE platform

Ask authors/readers for more resources

This paper presents a novel ML-based framework designed to improve effluent quality control in WWTPs by clarifying the relationships between operational variables and effluent parameters. The study shows that influent temperature has a significant impact on TSSe and PO4(e), but in different ways; PO4(e) strongly depends on the TSS concentration in aeration basins, and the impact of TSS in aeration basins on effluent parameters increases with the distances of the basin from the merging outlet; Returning excessive amounts of sludge through the second return sludge pipe should be avoided due to its adverse impact on effluent quality.
Due to the intrinsic complexity of wastewater treatment plant (WWTP) processes, it is always challenging to respond promptly and appropriately to the dynamic process conditions in order to ensure the quality of the effluent, especially when operational cost is a major concern. Machine Learning (ML) methods have therefore been used to model WWTP processes in order to avoid various shortcomings of conventional mechanistic models. However, to the best of the authors' knowledge, no ML applications have focused on investigating how operational factors can affect effluent quality. Additionally, the time lags between process steps have always been neglected, making it difficult to explain the relationships between operational factors and effluent quality. Therefore, this paper presents a novel ML-based framework designed to improve effluent quality control in WWTPs by clarifying the relationships between operational variables and effluent parameters. The framework consists of Random Forest (RF) models, Deep Neural Network (DNN) models, Variable Importance Measure (VIM) analyses, and Partial Dependence Plot (PDP) analyses, and uses a novel approach to account for the impact of time lags between processes. Details of the framework are provided along with a demonstration of its practical applicability based on a case study of the lima WWTP in Sweden involving a large number of samples (105763) representing the full scale of the plant's operations. Two effluent parameters, Total Suspended Solids in effluent (TSSe) and Phosphate in effluent (PO4(e)), and thirty-two operational variables are studied. RF models are developed, validated using DNN models as references, and shown to be suitable for VIM and PDP analyses. VIM identifies the variables that most strongly influence TSSe and PO4(e), while PDP elucidates their specific effects on TSSe, and PO4(e). The major findings are: (1) Influent temperature is the most influential variable for both TSSe and PO4(e), but it affects them in different ways; (2) PO4(e) depends strongly on the TSS in aeration basins - higher TSS concentrations in aeration basins generally promote PO4 removal, but excess TSS can have negative effects; (3) In general, the impact of TSS in aeration basins on TSSe and PO4(e) increases with the distances of the basin from the merging outlet, so more attention should be paid to the TSS concentration in the third or fourth aeration basins than the first and second ones; (4) Returning excessive amounts of sludge through the second return sludge pipe should be avoided because of its adverse impact on TSSe removal. These results could support the development of more advanced control strategies to increase control precision and reduce running costs in the Umea WWTP and other similarly configured WWTPs. The framework could also be applied to other parameters in WWTPs and industrial processes in general if sufficient high-resolution data are available. (C) 2021 The Authors. Published by Elsevier B.V.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available