4.7 Article

Exploring the effects of the design and quantity of absence data on the performance of random forest-based landslide susceptibility mapping

Journal

CATENA
Volume 176, Issue -, Pages 45-64

Publisher

ELSEVIER
DOI: 10.1016/j.catena.2018.12.035

Keywords

Absence data; landslide susceptibility map; Random forest; Sampling method

Funding

  1. National Natural Science Foundation of China [41431177, 41601413]
  2. National Basic Research Program of China [2015CB954102]
  3. Natural Science Foundation of Jiangsu Province of China [BK20150975]
  4. PAPD
  5. Outstanding Innovation Team in Colleges and Universities in Jiangsu Province
  6. Vilas Associate Award
  7. Hammel Faculty Fellow Award
  8. Manasse Chair Professorship from the University of Wisconsin-Madison

Ask authors/readers for more resources

This study aims to explore the effects of the design and quantity of absence data on the performance of random forest-based landslide susceptibility mapping. First, the Youfang catchment was selected as the study area, since it has undergone landslides in recent years. Second, the Random, Bioclim, Improved Target Space Exteriorization Sampling (ITSES), one class SVM and domain methods were chosen to generate the absence data. Then, the size of absence data was set to 79, 158, 237, 316, 395, 790, 1580, 3160, 4740, 6320, and 7900, and we repeated the experiment 20 times to get reliable results. The random forest method was selected to construct the landslide susceptibility model. In addition, the model's accuracy and the continuous Boyce index were selected to evaluate the performance of the landslide susceptibility mapping under the various designs and different size of absence data. The results showed that the sampling range and size of the absence data demonstrated significant influences on the accuracy of the landslide susceptibility mapping. When the sampling area was reduced, the size of the absence data needed to be increased (1:5) to balance the presence and absence data in the environmental characteristic space. When the sample area was large (in the random method, it is 99%), the best ratio for the presence to absence data was 1:1. When the sample area was small, the best ratio for the presence to absence data was 1:100. Finally, the generated landslide susceptibility map may be useful for the local government and land use planners in the study area.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available