4.7 Article

Comparison of regression-based and machine learning techniques to explain alpha diversity of fish communities in streams of central and eastern India

期刊

ECOLOGICAL INDICATORS
卷 129, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.ecolind.2021.107922

关键词

Freshwater fish; Artificial neural network; Linear mixed models; Multivariate adaptive regression splines; Generalized additive models

资金

  1. Ministry of Environment, Forests and Climate Change (MoEFCC) [14/87/2014ERS/RE]
  2. Indian Institute of Science Education and Research (IISER) Kolkata
  3. Council of Scientific and Industrial Research, India (CSIR India)

向作者/读者索取更多资源

Ecologists have been developing models to describe species-habitat relationships within ecological communities, with a focus on the complex interactions between communities and ecological factors. In a study on freshwater fish in eastern and central India, various modeling methods were used to predict species richness and diversity, with conductivity, water temperature, and water velocity identified as important factors. An artificial neural network model was found to be the most suitable for capturing nonlinear relationships between variables, highlighting the importance of variable selection in predictive modeling.
Over the past several decades, ecologists have been striving to develop models that accurately describe specieshabitat relationships across ecological communities. Statistical models that explain ecological dynamics need to consider the nuances of the complex interactions between communities and ecological factors. Here, we used multiple linear mixed models (LMM), generalized additive models (GAM), multivariate adaptive regression splines (MARS), and artificial neural networks (ANN) to model species richness and diversity of freshwater fishes in eastern and central India. The models were based on fish abundance and associated ecological data over three years across the study regions. We developed global models using all predictors after removing highly correlated variables (Pearson's r > 0.7). Results revealed conductivity, water temperature, and water velocity as the most important predictive factors of both species richness and diversity. We, then, built two subsets of selected factors to build predictive models for diversity and richness- one variable set containing common significant factors as revealed from the four different modeling methods used and the second, using an automatic feature selection technique. Amongst the modeling methods used in our study, ANN was found to create the best fit models for explaining nonlinearities between response variables and predictors. The importance of variable selection is highlighted, given that subset 1 (common consensual factors) creates more homogeneity in predictions compared to using subset 2 (automated feature selection). Contrary to similar studies in recent years, which show machine learning (ML) methods to typically outperform conventional methods, our results revealed that ANN performed at par with other methods in terms of predictive power. Our findings underline the need for a judicious choice of modeling techniques based on the availability of the data and the ecological communities being studied.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据