4.8 Article

Water quality prediction and classification based on principal component regression and gradient boosting classifier approach

Publisher

ELSEVIER
DOI: 10.1016/j.jksuci.2021.06.003

Keywords

Water quality index; Principal component regression; Classification algorithm; Boxplot analysis

Ask authors/readers for more resources

This paper presents a water quality prediction model using principal component regression technique and gradient boosting classifier. The model achieves accurate prediction of water quality index and classification of water quality status through principal component analysis and parameter extraction.
Estimating water quality has been one of the significant challenges faced by the world in recent decades. This paper presents a water quality prediction model utilizing the principal component regression tech-nique. Firstly, the water quality index (WQI) is calculated using the weighted arithmetic index method. Secondly, the principal component analysis (PCA) is applied to the dataset, and the most dominant WQI parameters have been extracted. Thirdly, to predict the WQI, different regression algorithms are used to the PCA output. Finally, the Gradient Boosting Classifier is utilized to classify the water quality status. The proposed system is experimentally evaluated on a Gulshan Lake-related dataset. The results demonstrate 95% prediction accuracy for the principal component regression method and 100% classification accuracy for the Gradient Boosting Classifier method, which show credible performance compared with the state -of-art models. (c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available