4.2 Article

Comparison of four heterogeneity measures for meta-analysis

期刊

JOURNAL OF EVALUATION IN CLINICAL PRACTICE
卷 26, 期 1, 页码 376-384

出版社

WILEY
DOI: 10.1111/jep.13159

关键词

heterogeneity; I-2 statistic; meta-analysis; statistical power

向作者/读者索取更多资源

Rationale, aims, and objectives: Heterogeneity is a critical issue in meta-analysis, because it implies the appropriateness of combining the collected studies and impacts the reliability of the synthesized results. The Q test is a traditional method to assess heterogeneity; however, because it does not have an intuitive interpretation for clinicians and often has low statistical power, many meta-analysts alter to use some measures, such as the I-2 statistic, to quantify the extent of heterogeneity. This article aims at providing a summary of available tools to assess heterogeneity and comparing their performance. Methods: We reviewed four heterogeneity measures (I-2, (R) over cap (I), (R) over cap (M), and (R) over cap (b)) and illustrated how they could be treated as test statistics like the Q statistic. These measures were compared with respect to statistical power based on simulations driven by three real-data examples. The pairwise agreement among the four measures was also evaluated using Cohen's. coefficient. Results: Generally, (R) over cap (I) was slightly more powerful than the Q test, while its type I error rate might be slightly inflated. The power of I-2 was fairly close to that of Q. The (R) over cap (M) and (R) over cap (b) statistics might have low powers in some cases. Because the differences between the powers of I-2, (R) over cap (I), and Q were often tiny, meta-analysts might not expect I-2 and (R) over cap (I) to yield significant heterogeneity if the Q test failed to do so. In addition, I-2 and (R) over cap (I) had fairly good agreement based on the simulated meta-analyses, but all other pairs of heterogeneity measures generally had poor agreement. Conclusion: The I-2 and (R) over cap (I) statistics are recommended for measuring heterogeneity. Meta-analysts should use the heterogeneity measures as descriptive statistics which have intuitive interpretations from the clinical perspective, instead of determining the significance of heterogeneity simply based on their magnitudes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据