4.6 Article

Combining multiple data sources in species distribution models while accounting for spatial dependence and overfitting with combined penalized likelihood maximization

期刊

METHODS IN ECOLOGY AND EVOLUTION
卷 10, 期 12, 页码 2118-2128

出版社

WILEY
DOI: 10.1111/2041-210X.13297

关键词

area-interaction models; combined likelihood framework; diagnostic tools; lasso-type penalties; occupancy models; point process models; presence-only data; species distribution models

类别

向作者/读者索取更多资源

The increase in availability of species datasets means that approaches to species distribution modelling that incorporate multiple datasets are in greater demand. Recent methodological developments in this area have led to combined likelihood approaches, in which a log-likelihood comprised of the sum of the log-likelihood components of each data source is maximized. Often, these approaches make use of at least one presence-only dataset and use the log-likelihood of an inhomogeneous Poisson point process model in the combined likelihood construction. While these advancements have been shown to improve predictive performance, they do not currently address challenges in presence-only modelling such as checking and correcting for violations of the independence assumption of a Poisson point process model or more general challenges in species distribution modelling such as overfitting. In this paper, we present an extension of the combined likelihood framework which accommodates alternative presence-only likelihoods in the presence of spatial dependence as well as lasso-type penalties to account for potential overfitting. We compare the proposed combined penalized likelihood approach to the standard combined likelihood approach via simulation and apply the method to modelling the distribution of the Eurasian lynx in the Jura Mountains in eastern France. The simulations show that the proposed combined penalized likelihood approach has better predictive performance than the standard approach when spatial dependence is present in the data. The lynx analysis shows that the predicted maps vary significantly between the model fitted with the proposed combined penalized approach accounting for spatial dependence and the model fitted with the standard combined likelihood. This work highlights the benefits of careful consideration of the presence-only components of the combined likelihood formulation, and allows greater flexibility and ability to accommodate real datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据