4.7 Article

River water quality index prediction and uncertainty analysis: A comparative study of machine learning models

Journal

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.jece.2020.104599

Keywords

Water quality index; River water quality; Lam Tsuen river; Ensemble machine learning

Ask authors/readers for more resources

The study introduces a new ensemble machine learning model, Extra Tree Regression (ETR), for predicting monthly Water Quality Index (WQI) values at the Lam Tsuen River in Hong Kong. The ETR model generally produces more accurate WQI predictions for both training and testing phases, demonstrating improvement in prediction performance and reduction in the number of input parameters compared to previous approaches.
The Water Quality Index (WQI) is the most common indicator to characterize surface water quality. This study introduces a new ensemble machine learning model called Extra Tree Regression (ETR) for predicting monthly WQI values at the Lam Tsuen River in Hong Kong. The ETR model performance is compared with that of the classic standalone models, Support Vector Regression (SVR) and Decision Tree Regression (DTR). The monthly input water quality data including Biochemical Oxygen Demand (BOD), Chemical Oxygen Demand (COD), Dissolved Oxygen (DO), Electrical Conductivity (EC), Nitrate-Nitrogen (NO3 -N), Nitrite-Nitrogen (NO2 -N), Phosphate (PO43-), potential for Hydrogen (pH), Temperature (T) and Turbidity (TUR) are used for building the prediction models. Various input data combinations are investigated and assessed in terms of prediction performance, using numerical indices and graphical comparisons. The analysis shows that the ETR model generally produces more accurate WQI predictions for both training and testing phases. Although including all the ten input variables achieves the highest prediction performance (R-test(2) = 0.98, RMSEtest = 2.99), a combination of input parameters including only BOD, Turbidity and Phosphate concentration provides the second highest prediction accuracy (R-test(2) = 0.97, RMSEtest = 3.74). The uncertainty analysis relative to model structure and input parameters highlights a higher sensitivity of the prediction results to the former. In general, the ETR model represents an improvement on previous approaches for WQI prediction, in terms of prediction performance and reduction in the number of input parameters.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available