4.7 Article

On evaluating species distribution models with random background sites in place of absences when test presences disproportionately sample suitable habitat

期刊

DIVERSITY AND DISTRIBUTIONS
卷 19, 期 7, 页码 867-872

出版社

WILEY
DOI: 10.1111/ddi.12031

关键词

AUC; background sites; biased data; model evaluation; species distribution models

资金

  1. US Institute of Museum and Library Services

向作者/读者索取更多资源

Modelling the distribution of rare and invasive species often occurs in situations where reliable absences for evaluating model performance are unavailable. However, predictions at randomly located sites, or background' sites, can stand in for true absences. The maximum value of the area under the receiver operator characteristic curve, AUC, calculated with background sites is believed to be 1-a/2, where a is the typically unknown prevalence of the species on the landscape. Using a simple example of a species' range, I show how AUC can achieve values >1-a/2 when test presences do not represent each inhabited region of a species__ range in proportion to its area. Values of AUC that surpass 1-a/2 are associated with higher model predictions in areas overrepresented in the test data set, even if they are less environmentally suitable than other regions the species occupies. Pursuit of high AUC values can encourage inclusion of spurious predictors in the final model if they help to differentiate areas with disproportionate representation in the test data. Choices made during modelling to increase AUC calculated with background sites on the assumption that higher scores connote more accurate models can decrease actual accuracy when test presences disproportionately represent inhabited areas.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据