☆ 4.7 Article

Application and comparison of different ensemble learning machines combining with a novel sampling strategy for shallow landslide susceptibility mapping

STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT (2021)

Journal

STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT

Volume 35, Issue 6, Pages 1243-1256

Publisher

SPRINGER

DOI: 10.1007/s00477-020-01893-y

Keywords

Shallow landslide; Susceptibility; Ensemble learning; K-means clustering

Funding

National Natural Science Foundation of China [41972267, 41572257]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study aims to evaluate and compare the performance of four models for landslide susceptibility modeling, with gradient boosting decision tree (GBDT) performing best in both training and validation datasets. The results indicate that GBDT is the most suitable model and can be further enhanced by combining with clustering analysis for improved sampling strategy of non-landslide points.

The existence of shallow landslide brings huge threats to the human lives and economic development, as the Lang County, Southeastern Tibet prone to landslide. Landslide susceptibility mapping (LSM) is considered as the key for the prevention of hazard. The primary goal of the present study is to assess and compare four models: classification and regression tree, gradient boosting decision tree (GBDT), adaptive boosting-decision tree and random forest for the performance of landslide susceptibility modeling. Firstly, a landslide inventory map consisting of 229 historical shallow landslide locations was prepared and the same number of non-landslide points was determined by k-means clustering. Secondly, 12 conditioning factors were considered in the landslide susceptibility modeling. The prediction performance of the four models were estimated by fivefold cross validation and relative operating characteristic curve (ROC), area under the ROC curve (AUC) and statistical measures. The results showed that the GBDT performed best in the training and validation dataset, with the highest prediction capability (AUC = 0.986 and 0.940), highest accuracy value (95.3% and 88.1%) and highest kappa index (0.904 and 0.772), respectively. Therefore, the GBDT was considered to be the most suitable model and applied to the whole study area for LSM. The results of this study also demonstrate that the performance can be enhanced with the use of ensemble learning. The sampling strategy of non-landslide points can be improved by combining with clustering analysis which are more reasonable.

Application and comparison of different ensemble learning machines combining with a novel sampling strategy for shallow landslide susceptibility mapping

Journal

STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Application and comparison of different ensemble learning machines combining with a novel sampling strategy for shallow landslide susceptibility mapping

Journal

STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper