4.5 Article

Ecologists should not use statistical significance tests to interpret simulation model results

期刊

OIKOS
卷 123, 期 4, 页码 385-388

出版社

WILEY-BLACKWELL
DOI: 10.1111/j.1600-0706.2013.01073.x

关键词

-

类别

资金

  1. Florida Sea Grant [PD/12-18]
  2. Killam Fellowship

向作者/读者索取更多资源

Simulation models are widely used to represent the dynamics of ecological systems. A common question with such models is how changes to a parameter value or functional form in the model alter the results. Some authors have chosen to answer that question using frequentist statistical hypothesis tests (e.g. ANOVA). This is inappropriate for two reasons. First, p-values are determined by statistical power (i.e. replication), which can be arbitrarily high in a simulation context, producing minuscule p-values regardless of the effect size. Second, the null hypothesis of no difference between treatments (e.g. parameter values) is known a priori to be false, invalidating the premise of the test. Use of p-values is troublesome (rather than simply irrelevant) because small p-values lend a false sense of importance to observed differences. We argue that modelers should abandon this practice and focus on evaluating the magnitude of differences between simulations. Synthesis Researchers analyzing field or lab data often test ecological hypotheses using frequentist statistics (t-tests, ANOVA, etc.) that focus on p-values. Field and lab data usually have limited sample sizes, and p-values are valuable for quantifying the probability of making incorrect inferences in that situation. However, modern ecologists increasingly rely on simulation models to address complex questions, and those who were trained in frequentist statistics often apply the hypothesis-testing approach inappropriately to their simulation results. Our paper explains why p-values are not informative for interpreting simulation models, and suggests better ways to evaluate the ecological significance of model results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据