4.7 Article

Trial and error: A hierarchical modeling approach to test-retest reliability

Journal

NEUROIMAGE
Volume 245, Issue -, Pages -

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.neuroimage.2021.118647

Keywords

-

Funding

  1. NIMH
  2. NINDS Intramural Research Programs of the NIH/HHS, USA [ZICMH002888]
  3. NIMH Intramural Research Program [ZIAMH002781]

Ask authors/readers for more resources

This article discusses the issue of reliability in neuroimaging studies using the test-retest method, highlighting the risk of low reliability in common cognitive tasks. It proposes a hierarchical framework to evaluate reliability, suggesting that traditional intraclass correlation statistics may underestimate reliability and more trials are needed for accurate reliability estimates.
The concept of test-retest reliability indexes the consistency of a measurement across time. High reliability is critical for any scientific study, but specifically for the study of individual differences. Evidence of poor reliability of commonly used behavioral and functional neuroimaging tasks is mounting. Reports on low reliability of task-based fMRI have called into question the adequacy of using even the most common, well-characterized cognitive tasks with robust population-level effects, to measure individual differences. Here, we lay out a hierarchical framework that estimates reliability as a correlation divorced from trial-level variability, and show that reliability tends to be underestimated under the conventional intraclass correlation framework through summary statistics based on condition-level modeling. In addition, we examine how reliability estimation between the two statistical frameworks diverges and assess how different factors (e.g., trial and subject sample sizes, relative magnitude of cross-trial variability) impact reliability estimates. As empirical data indicate that cross-trial variability is large in most tasks, this work highlights that a large number of trials (e.g., greater than 100) may be required to achieve precise reliability estimates. We reference the tools TRR and 3dLMEr for the community to apply trial-level models to behavior and neuroimaging data and discuss how to make these new measurements most useful for future studies.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available