4.7 Article

Comprehensive evaluation of harmonization on functional brain imaging for multisite data-fusion

期刊

NEUROIMAGE
卷 274, 期 -, 页码 -

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.neuroimage.2023.120089

关键词

Comparison; Harmonization; Multi-site pooling; Resting-state fMRI

向作者/读者索取更多资源

To address the challenge of harmonizing the site effect in big-data neuroimaging, this study comprehensively evaluated various strategies for resting-state functional magnetic resonance imaging (R-fMRI) data fusion. The evaluation included tests on residual site effect, individual identification, test-retest reliability, and replicability of group-level statistical results for widely used R-fMRI metrics across different datasets. The results showed that the Subsampling Maximum-mean-distance based distribution shift correction Algorithm (SMA) and parametric unadjusted CovBat outperformed other methods in terms of individual identification and test-retest reliability. SMA also exhibited better replicability and robustness in detecting sex differences. Additionally, the study provided practical guidelines for optimizing SMA and suggested the importance of sample size and target site distribution. Overall, this work contributes to the improvement and innovation of harmonizing methodologies for big R-fMRI data.
To embrace big-data neuroimaging, harmonizing the site effect in resting-state functional magnetic resonance imaging (R-fMRI) data fusion is a fundamental challenge. A comprehensive evaluation of potentially effective harmonization strategies, particularly with specifically collected data, has been scarce, especially for R-fMRI metrics. Here, we comprehensively assess harmonization strategies from multiple perspectives, including tests on residual site effect, individual identification, test-retest reliability, and replicability of group-level statisti-cal results, on widely used R-fMRI metrics across various datasets, including data obtained from participants with repetitive measures at different scanners. For individual identifiability (i.e., whether the same subject could be identified across R-fMRI data scanned across different sites), we found that, while most methods decreased site effects, the Subsampling Maximum-mean-distance based distribution shift correction Algorithm (SMA) and parametric unadjusted CovBat outperformed linear regression models, linear mixed models, ComBat series and invariant conditional variational auto-encoder in clustering accuracy. Test-retest reliability was better for SMA and parametric adjusted CovBat than unadjusted ComBat series and parametric unadjusted CovBat in the number of overlapped voxels. At the same time, SMA was superior to the latter in replicability in terms of the Dice coef-ficient and the scale of brain areas showing sex differences reproducibly observed across datasets. Furthermore, SMA better detected reproducible sex differences of ALFF under the site-sex confounded situation. Moreover, we designed experiments to identify the best target site features to optimize SMA identifiability, test-retest reliabil-ity, and stability. We noted both sample size and distribution of the target site matter and introduced a heuristic formula for selecting the target site. In addition to providing practical guidelines, this work can inform continuing improvements and innovations in harmonizing methodologies for big R-fMRI data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据