4.7 Article

Reproducibility of R-fMRI metrics on the impact of different strategies for multiple comparison correction and sample sizes

期刊

HUMAN BRAIN MAPPING
卷 39, 期 1, 页码 300-318

出版社

WILEY
DOI: 10.1002/hbm.23843

关键词

multiple comparison correction strategies; positive predictive value; replicability; reproducibility; resting-state fMRI; sample size; sensitivity; test-retest reliability

资金

  1. National Key R&D Program of China [2017YFC1309902]
  2. National Natural Science Foundation of China [81671774, 81630031]
  3. Hundred Talents Program of the Chinese Academy of Sciences
  4. Beijing Municipal Science & Technology Commission [Z161100000216152]

向作者/读者索取更多资源

Concerns regarding reproducibility of resting-state functional magnetic resonance imaging (R-fMRI) findings have been raised. Little is known about how to operationally define R-fMRI reproducibility and to what extent it is affected by multiple comparison correction strategies and sample size. We comprehensively assessed two aspects of reproducibility, test-retest reliability and replicability, on widely used R-fMRI metrics in both between-subject contrasts of sex differences and within-subject comparisons of eyes-open and eyes-closed (EOEC) conditions. We noted permutation test with Threshold-Free Cluster Enhancement (TFCE), a strict multiple comparison correction strategy, reached the best balance between family-wise error rate (under 5%) and test-retest reliability/replicability (e.g., 0.68 for test-retest reliability and 0.25 for replicability of amplitude of low-frequency fluctuations (ALFF) for between-subject sex differences, 0.49 for replicability of ALFF for within-subject EOEC differences). Although R-fMRI indices attained moderate reliabilities, they replicated poorly in distinct datasets (replicability<0.3 for between-subject sex differences,<0.5 for within-subject EOEC differences). By randomly drawing different sample sizes from a single site, we found reliability, sensitivity and positive predictive value (PPV) rose as sample size increased. Small sample sizes (e.g.,<80 [40 per group]) not only minimized power (sensitivity<2%), but also decreased the likelihood that significant results reflect true effects (PPV<0.26) in sex differences. Our findings have implications for how to select multiple comparison correction strategies and highlight the importance of sufficiently large sample sizes in R-fMRI studies to enhance reproducibility. Hum Brain Mapp 39:300-318, 2018. (c) 2017 Wiley Periodicals, Inc.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据