4.7 Article

A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps

期刊

ECOLOGICAL INFORMATICS
卷 69, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.ecoinf.2021.101501

关键词

Bird species; Data pooling; Model-based data integration; Presence-only data; Random forest; Species distribution models; Swiss Alps

类别

资金

  1. University of Lausanne
  2. Iran's Ministry of Science, Research and Technology

向作者/读者索取更多资源

Accurately modeling species distributions and biodiversity is crucial for ecological and conservation challenges. Data on species distributions and abundance are essential for decision-making on conservation priority. Integrating data from multiple sources using model-based data integration approaches can improve the predictive accuracy of species distribution models. Machine learning algorithms, particularly tree-dependent ensembles like Random Forest, are the most accurate techniques for predicting bird species distribution.
It is essential to accurately model species distributions and biodiversity in response to many ecological and conservation challenges. The primary means of reliable decision-making on conservation priority are the data on the distributions and abundance of species. However, finding data that is accurate and reliable for predicting species distribution could be challenging. Data could come from different sources, with different designs, coverage, and potential sampling biases. In this study, we examined the emerging methods of modelling species distribution that integrate data from multiple sources such as systematic or standardized and casual or occasional surveys. We applied two modelling approaches, data-pooling and model-based data integration that each involves combining various datasets to measure environmental interactions and clarify the distribution of spe-cies. Our paper demonstrates a reliable data integration workflow that includes gathering information on model -based data integration, creating a sub-model of each dataset independently, and finally, combining it into a single final model. We have shown that this is a more reliable way of developing a model than a data pooling strategy that combines multiple data sources to fit a single model. Moreover, data integration approaches could improve the poor predictive performance of systematic small datasets, through model-based data integration techniques that enhance the predictive accuracy of Species Distribution Models. We also identified, consistent with previous research, that machine learning algorithms are the most accurate techniques to predict bird species distribution in our heterogeneous study area in the western Swiss Alps. In particular, tree-dependent ensembles of Random Forest (RF) contribute to a better understanding of the interactions between species and the environment.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据