4.7 Article

Rapid Landslide Extraction from High-Resolution Remote Sensing Images Using SHAP-OPT-XGBoost

Journal

REMOTE SENSING
Volume 15, Issue 15, Pages -

Publisher

MDPI
DOI: 10.3390/rs15153901

Keywords

landslide extraction; XGBoost; high-resolution remote sensing; SHAP; Optuna

Ask authors/readers for more resources

This study integrates Shapley Additive Explanation (SHAP) and Optuna (OPT) hyperparameter tuning into four basic machine learning algorithms and applies them to landslide extraction in Fengjie County, Chongqing, China. The experimental results show that the four SHAP-OPT models have an accuracy above 92% and a training time less than 1.3 seconds. Among them, SHAP-OPT-XGBoost achieves the highest accuracy (96.26%) and can extract landslide distribution information accurately and quickly.
Landslides, the second largest geological hazard after earthquakes, result in significant loss of life and property. Extracting landslide information quickly and accurately is the basis of landslide disaster prevention. Fengjie County, Chongqing, China, is a typical landslide-prone area in the Three Gorges Reservoir Area. In this study, we newly integrate Shapley Additive Explanation (SHAP) and Optuna (OPT) hyperparameter tuning into four basic machine learning algorithms: Gradient Boosting Decision Tree (GBDT), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Additive Boosting (AdaBoost). We construct four new models (SHAP-OPT-GBDT, SHAP-OPT-XGBoost, SHAP-OPT-LightGBM, and SHAP-OPT-AdaBoost) and apply the four new models to landslide extraction for the first time. Firstly, high-resolution remote sensing images were preprocessed, landslide and non-landslide samples were constructed, and an initial feature set with 48 features was built. Secondly, SHAP was used to select features with significant contributions, and the important features were selected. Finally, Optuna, the Bayesian optimization technique, was utilized to automatically select the basic models' best hyperparameters. The experimental results show that the accuracy (ACC) of these four SHAP-OPT models was above 92% and the training time was less than 1.3 s using mediocre computational hardware. Furthermore, SHAP-OPT-XGBoost achieved the highest accuracy (96.26%). Landslide distribution information in Fengjie County from 2013 to 2020 can be extracted by SHAP-OPT-XGBoost accurately and quickly.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available