4.7 Article

Landslide susceptibility zonation method based on C5.0 decision tree and K-means cluster algorithms to improve the efficiency of risk management

期刊

GEOSCIENCE FRONTIERS
卷 12, 期 6, 页码 -

出版社

CHINA UNIV GEOSCIENCES, BEIJING
DOI: 10.1016/j.gsf.2021.101249

关键词

Landslide susceptibility; Frequency ratio; C5.0 decision tree; K-means cluster; Classification; Risk management

资金

  1. National Natural Science Foundation of China [41807285, 51679117]
  2. Key Project of the State Key Laboratory of Geohazard Prevention and Geoenvironment Protection [SKLGP2019Z002]
  3. National Science Foundation of Jiangxi Province, China [20192BAB216034]
  4. China Postdoctoral Science Foundation [2019M652287, 2020T130274]
  5. Jiangxi Provincial Postdoctoral Science Foundation [2019KY08]
  6. Fundamental Research Funds for National Universities, China University of Geosciences (Wuhan)

向作者/读者索取更多资源

This study introduced a machine learning approach based on the C5.0 decision tree model and the K-means cluster algorithm to produce a regional landslide susceptibility map, which outperformed traditional models in terms of model performance according to the validation results.
Machine learning algorithms are an important measure with which to perform landslide susceptibility assessments, but most studies use GIS-based classification methods to conduct susceptibility zonation. This study presents a machine learning approach based on the C5.0 decision tree (DT) model and the K-means cluster algorithm to produce a regional landslide susceptibility map. Yanchang County, a typical landslide-prone area located in northwestern China, was taken as the area of interest to introduce the proposed application procedure. A landslide inventory containing 82 landslides was prepared and subsequently randomly partitioned into two subsets: training data (70% landslide pixels) and validation data (30% landslide pixels). Fourteen landslide influencing factors were considered in the input dataset and were used to calculate the landslide occurrence probability based on the C5.0 decision tree model. Susceptibility zonation was implemented according to the cut-off values calculated by the K-means cluster algorithm. The validation results of the model performance analysis showed that the AUC (area under the receiver operating characteristic (ROC) curve) of the proposed model was the highest, reaching 0.88, compared with traditional models (support vector machine (SVM) = 0.85, Bayesian network (BN) = 0.81, frequency ratio (FR) = 0.75, weight of evidence (WOE) = 0.76). The landslide frequency ratio and frequency density of the high susceptibility zones were 6.76/km(2) and 0.88/km(2), respectively, which were much higher than those of the low susceptibility zones. The top 20% interval of landslide occurrence probability contained 89% of the historical landslides but only accounted for 10.3% of the total area. Our results indicate that the distribution of high susceptibility zones was more focused without containing more stable pixels. Therefore, the obtained susceptibility map is suitable for application to landslide risk management practices. (C) 2021 China University of Geosciences (Beijing) and Peking University. Production and hosting by Elsevier B.V.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据