4.7 Article

Balancing transferability and complexity of species distribution models for rare species conservation

Journal

DIVERSITY AND DISTRIBUTIONS
Volume 27, Issue 1, Pages 95-108

Publisher

WILEY
DOI: 10.1111/ddi.13174

Keywords

cross-validation; ENMeval; Maxent; over-parameterization; spatial prediction; statistical regularization

Funding

  1. United States Fish and Wildlife Service
  2. Idaho Department of Fish and Game

Ask authors/readers for more resources

The delineation of cross-validation folds has a substantial impact on model complexity and predictions. Different cross-validation approaches resulted in models with apparently high out-of-sample predictive ability, but optimal model complexity varied significantly. Careful consideration of cross-validation approaches is crucial for improving predictive accuracy and model transferability.
Aim Species distribution models (SDMs) are valuable for rare species conservation and are commonly used to extrapolate predictions of habitat suitability geographically to regions where species occurrence is unknown (i.e., transferability). Spatially structured cross-validation can be used to infer transferability, yet, few studies have evaluated how delineation of cross-validation folds affects model complexity and predictions. We developed SDMs using multiple cross-validation approaches to understand the implications for predicting habitat suitability for northern Idaho ground squirrels, a rare, federally threatened species that has been extensively surveyed in regions where known populations occur, resulting in >8000 presence locations. Location Idaho, USA. Methods We delineated cross-validation folds by mimicking the manner in which predictions would be geographically extrapolated or by using existing dispersal barriers. We varied the distance between, number, and directionality of folds. We conducted a grid search on statistical regularization parameters to optimize model complexity, covering a range of values exceeding that typically implemented. For each cross-validation approach, we selected optimal regularization and model complexity based on out-of-sample predictive ability. Results Delineation of cross-validation folds substantially affected resulting model complexity and extrapolated predictions. All cross-validation approaches resulted in models with apparently high out-of-sample predictive ability, yet optimal model complexity varied substantially among the approaches. Regularization demonstrated a noisy relationship between model complexity and prediction, where local optima in predictive performance were common at small values. Main conclusion Subtle modelling decisions can have large consequences for predictions of habitat suitability and transferability of SDMs. When transferability is the goal, cross-validation approaches should be considered carefully and mimic the manner in which spatial extrapolation will occur, else overly complex models with inflated assessments of predictive accuracy may result. Further, spatially structured cross-validation may not guard against over-parameterization, and assessing a broader range of regularization parameters may be necessary to optimize model complexity for transferability.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available