4.7 Article

XGBoost-based method for flash flood risk assessment

期刊

JOURNAL OF HYDROLOGY
卷 598, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.jhydrol.2021.126382

关键词

Flash flood; Risk assessment; XGBoost; Yunnan

资金

  1. National Key R&D Program of China [2018YFC1508105]
  2. Natural Science Foundation Yalong River Joint Fund [U1865102]
  3. National Natural Science Foundation of China (NSFC) [41471430]
  4. Research on Key Technologies of Flash Flood Prevention Based on Multi-machine Learning Model [2019KJ086]

向作者/读者索取更多资源

This study introduced the XGBoost model for flash flood risk assessment and validated its effectiveness by combining with other models. Through attribution analysis and risk map generation, it successfully demonstrated the accuracy and practicality of XGBoost in risk assessment.
Flash flood risk assessment, a widely applied technology in preventing catastrophic flash flood disasters, has become the current research hotspot. However, most existing machine learning methods for assessing flash flood risk rely on a single classifier, which is suitable for processing small sets of sample data, but the resulting prediction accuracy and generalization ability are insufficient. Meanwhile, machine learning methods that integrate multiple classifiers are thus far unknown. Extreme Gradient Boosting (XGBoost) is an excellent algorithm for ensemble learning methods which has achieved remarkable results in many fields. It not only optimizes the algorithm but also automatically applies the CPU's multi-threading to perform parallel calculations, thus greatly improving the model training speed and prediction accuracy. Therefore, this article introduces the XGBoost model for the assessment of flash flood risk, and then combines the two input strategies and the Least squares support vector machine (LSSVM) model to verify its optimal effect, thus proposing the XGBoost-based method for flash flood risk assessment. Subsequently, an attribution analysis was implemented to assess the possible errors of this approach; and finally, a county-level flash flood risk map for Yunnan Province, China, was generated based on the proposed method. The results demonstrate that: (1) XGBoost performs well, with an accuracy of 0.84 in the testing period, and its five indices (precision, recall, accuracy, kappa, and F-score) are all higher than those of LSSVM. (2) The XGBoost-based approach provided the reliable flash flood risk maps, which were validated by another flash flood inventory, although some errors may be attributed to critical environmental factors and statistical disaster location accuracy. (3) The high-risk counties (including high-risk and highest-risk) accounted for 40.3%, with the highest-risk counties mainly concentrated in southeastern Yunnan. This article further addresses the limitations of XGBoost (e.g., as a time-consuming greedy algorithm, the non-necessity of multi-threaded optimization). All of the above results indicate that the XGBoost-based method is an effective method for obtaining high-quality county-level flash flood risk maps, which contributes to the theoretical basis for ongoing county-level flash flood prevention in China.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据