4.7 Article

Ensemble Boosting and Bagging Based Machine Learning Models for Groundwater Potential Prediction

期刊

WATER RESOURCES MANAGEMENT
卷 35, 期 1, 页码 23-37

出版社

SPRINGER
DOI: 10.1007/s11269-020-02704-3

关键词

Groundwater potential prediction; Ensemble machine learning models; Decision tree; Boosting; Bagging; Recursive feature elimination

资金

  1. Alexander von Humboldt Foundation

向作者/读者索取更多资源

The study aims to evaluate four ensemble models for groundwater potential prediction, with RF model outperforming the others. Key contributing variables include topographic position index and valley depth. The predicted groundwater potential maps can assist water managers and policymakers in optimal freshwater exploitation.
Due to the rapidly increasing demand for groundwater, as one of the principal freshwater resources, there is an urge to advance novel prediction systems to more accurately estimate the groundwater potential for an informed groundwater resource management. Ensemble machine learning methods are generally reported to produce more accurate results. However, proposing the novel ensemble models along with running comparative studies for performance evaluation of these models would be equally essential to precisely identify the suitable methods. Thus, the current study is designed to provide knowledge on the performance of the four ensemble models i.e., Boosted generalized additive model (GamBoost), adaptive Boosting classification trees (AdaBoost), Bagged classification and regression trees (Bagged CART), and random forest (RF). To build the models, 339 groundwater resources' locations and the spatial groundwater potential conditioning factors were used. Thereafter, the recursive feature elimination (RFE) method was applied to identify the key features. The RFE specified that the best number of features for groundwater potential modeling was 12 variables among 15 (with a mean Accuracy of about 0.84). The modeling results indicated that the Bagging models (i.e., RF and Bagged CART) had a higher performance than the Boosting models (i.e., AdaBoost and GamBoost). Overall, the RF model outperformed the other models (with accuracy = 0.86, Kappa = 0.67, Precision = 0.85, and Recall = 0.91). Also, the topographic position index's predictive variables, valley depth, drainage density, elevation, and distance from stream had the highest contribution in the modeling process. Groundwater potential maps predicted in this study can help water resources managers and policymakers in the fields of watershed and aquifer management to preserve an optimal exploit from this important freshwater.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据