4.0 Article

Predicting Optical Water Quality Indicators from Remote Sensing Using Machine Learning Algorithms in Tropical Highlands of Ethiopia

期刊

HYDROLOGY
卷 10, 期 5, 页码 -

出版社

MDPI
DOI: 10.3390/hydrology10050110

关键词

water quality; Landsat; machine learning; Lake Tana

向作者/读者索取更多资源

This study explored the use of remote sensing imagery and machine learning algorithms to monitor water quality, predicting three water quality indicators. The results showed that XGB algorithm performed best in predicting Chlorophyll a, while RF algorithm performed best in predicting total dissolved solids and turbidity. The study demonstrates the possibility of using remote sensing and machine learning algorithms to monitor water quality in large freshwater bodies with limited data.
Water quality degradation of freshwater bodies is a concern worldwide, particularly in Africa, where data are scarce and standard water quality monitoring is expensive. This study explored the use of remote sensing imagery and machine learning (ML) algorithms as an alternative to standard field measuring for monitoring water quality in large and remote areas constrained by logistics and finance. Six machine learning (ML) algorithms integrated with Landsat 8 imagery were evaluated for their accuracy in predicting three optically active water quality indicators observed monthly in the period from August 2016 to April 2022: turbidity (TUR), total dissolved solids (TDS) and Chlorophyll a (Chl-a). The six ML algorithms studied were the artificial neural network (ANN), support vector machine regression (SVM), random forest regression (RF), XGBoost regression (XGB), AdaBoost regression (AB), and gradient boosting regression (GB) algorithms. XGB performed best at predicting Chl-a, with an R-2 of 0.78, Nash-Sutcliffe efficiency (NSE) of 0.78, mean absolute relative error (MARE) of 0.082 and root mean squared error (RMSE) of 9.79 mu g/L. RF performed best at predicting TDS (with an R-2 of 0.79, NSE of 0.80, MARE of 0.082, and RMSE of 12.30 mg/L) and TUR (with an R-2 of 0.80, NSE of 0.81, and MARE of 0.072 and RMSE of 7.82 NTU). The main challenges were data size, sampling frequency, and sampling resolution. To overcome the data limitation, we used a K-fold cross validation technique that could obtain the most out of the limited data to build a robust model. Furthermore, we also employed stratified sampling techniques to improve the ML modeling for turbidity. Thus, this study shows the possibility of monitoring water quality in large freshwater bodies with limited observed data using remote sensing integrated with ML algorithms, potentially enhancing decision making.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据