4.6 Article

Comparative assessment of bivariate, multivariate and machine learning models for mapping flood proneness

期刊

NATURAL HAZARDS
卷 100, 期 2, 页码 461-491

出版社

SPRINGER
DOI: 10.1007/s11069-019-03821-y

关键词

Machine learning models; Bivariate models; Flood; Iraq

向作者/读者索取更多资源

This study applied two bivariate statistical models (frequency ratio and information value), one multivariate statistical model (logistic regression), and two supervised statistical learning models (boosted regression trees and classification and regression trees) for mapping flood proneness in an arid region of southern Iraq. For this purpose, ten flood causative factors were chosen based on data availability and local conditions along with the spatial extent of the large flood that affected the study area on 13 May 2013. The factors used involved topography-related factors (elevation, slope, curvature, topographic wetness index, and stream power index), lithology, soil, land use/land cover, the average of annual rainfall, and distance to rivers. The multicollinearity test proved that there was no multicollinearity problem among the factors used. Investigating the worth of factors in building the models using information gain ratio showed that the most important factors that play a major role in controlling flood proneness were elevation, followed by annual rainfall average, distance to rivers, land use/land cover, lithology, and soil. The models were employed using the most important factors to get flood proneness maps. The values of flood proneness were categorized into five classes using a quantile classification scheme. For validating the models, area under the receiver operating characteristic curve (AUC) was used. The AUC for prediction data set was 0.793, 0.786, 0.779, 0.754, and 0.753 for classification and regression trees, boosted regression trees, logistic regression, information value, and frequency ratio, respectively. For the best performance model (classification and regression trees), the areas occupied by flood proneness zones were 2735 km(2), 2809 km(2), 2816 km(2), 2732 km(2), and 2801 km(2), for very low, low, moderate, high, and very high flood proneness zones, respectively. The main conclusion is that the machine learning models are optimal in mapping flood proneness in the study area, followed by the multivariate and bivariate models. Decision makers and hydrologists for improved management of access floodwater and prevention of flood-related damages can adopt the flood proneness maps developed in this study.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据