☆ 4.2 Article

Comparison of four heterogeneity measures for meta-analysis

JOURNAL OF EVALUATION IN CLINICAL PRACTICE (2020)

期刊

JOURNAL OF EVALUATION IN CLINICAL PRACTICE

卷 26, 期 1, 页码 376-384

出版社

WILEY

DOI: 10.1111/jep.13159

关键词

heterogeneity; I-2 statistic; meta-analysis; statistical power

类别

Health Care Sciences & Services Medical Informatics Medicine, General & Internal

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Rationale, aims, and objectives: Heterogeneity is a critical issue in meta-analysis, because it implies the appropriateness of combining the collected studies and impacts the reliability of the synthesized results. The Q test is a traditional method to assess heterogeneity; however, because it does not have an intuitive interpretation for clinicians and often has low statistical power, many meta-analysts alter to use some measures, such as the I-2 statistic, to quantify the extent of heterogeneity. This article aims at providing a summary of available tools to assess heterogeneity and comparing their performance. Methods: We reviewed four heterogeneity measures (I-2, (R) over cap (I), (R) over cap (M), and (R) over cap (b)) and illustrated how they could be treated as test statistics like the Q statistic. These measures were compared with respect to statistical power based on simulations driven by three real-data examples. The pairwise agreement among the four measures was also evaluated using Cohen's. coefficient. Results: Generally, (R) over cap (I) was slightly more powerful than the Q test, while its type I error rate might be slightly inflated. The power of I-2 was fairly close to that of Q. The (R) over cap (M) and (R) over cap (b) statistics might have low powers in some cases. Because the differences between the powers of I-2, (R) over cap (I), and Q were often tiny, meta-analysts might not expect I-2 and (R) over cap (I) to yield significant heterogeneity if the Q test failed to do so. In addition, I-2 and (R) over cap (I) had fairly good agreement based on the simulated meta-analyses, but all other pairs of heterogeneity measures generally had poor agreement. Conclusion: The I-2 and (R) over cap (I) statistics are recommended for measuring heterogeneity. Meta-analysts should use the heterogeneity measures as descriptive statistics which have intuitive interpretations from the clinical perspective, instead of determining the significance of heterogeneity simply based on their magnitudes.

Comparison of four heterogeneity measures for meta-analysis

期刊

JOURNAL OF EVALUATION IN CLINICAL PRACTICE

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Comparison of four heterogeneity measures for meta-analysis

期刊

JOURNAL OF EVALUATION IN CLINICAL PRACTICE

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文