4.7 Article

Use of the p-values as a size-dependent function to address practical differences when analyzing large datasets

期刊

SCIENTIFIC REPORTS
卷 11, 期 1, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41598-021-00199-5

关键词

-

资金

  1. Ministerio de Ciencia, Innovacion y Universidades, Agencia Estatal de Investigacion [TEC2015-73064-EXP, TEC2016-78052, PID2019-109820RB-I00, MCIN/AEI/10.13039/501100011033]
  2. European Regional Development Fund (ERDF)
  3. BBVA Foundation
  4. Leonardo Grant for Researchers and Cultural Creators (AMB)
  5. US National Institutes of Health [UO1AG060903, P30AG021334, U54CA143868]
  6. National Science Foundation Graduate Research Fellowship [1746891]
  7. NVIDIA Corporation

向作者/读者索取更多资源

This study examines the utilization of p-values in biomedical research, proposing a new approach to address the size effect in p-value interpretation for datasets with large sample sizes. The introduction of new descriptive parameters aims to reduce uncertainty in determining the existence of biological differences between compared experiments.
Biomedical research has come to rely on p-values as a deterministic measure for data-driven decision-making. In the largely extended null hypothesis significance testing for identifying statistically significant differences among groups of observations, a single p-value is computed from sample data. Then, it is routinely compared with a threshold, commonly set to 0.05, to assess the evidence against the hypothesis of having non-significant differences among groups, or the null hypothesis. Because the estimated p-value tends to decrease when the sample size is increased, applying this methodology to datasets with large sample sizes results in the rejection of the null hypothesis, making it not meaningful in this specific situation. We propose a new approach to detect differences based on the dependence of the p-value on the sample size. We introduce new descriptive parameters that overcome the effect of the size in the p-value interpretation in the framework of datasets with large sample sizes, reducing the uncertainty in the decision about the existence of biological differences between the compared experiments. The methodology enables the graphical and quantitative characterization of the differences between the compared experiments guiding the researchers in the decision process. An in-depth study of the methodology is carried out on simulated and experimental data. Code availability at https://github.com/BIIG-.UC3M/pMoSS.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据