4.5 Review

Avoiding Blunders When Analyzing Correlated Data, Clustered Data, or Repeated Measures

期刊

JOURNAL OF RHEUMATOLOGY
卷 50, 期 10, 页码 1269-1272

出版社

J RHEUMATOL PUBL CO
DOI: 10.3899/jrheum.2022-1109

关键词

clustered data; correlated data; mixed model; repeated measures; statistical software

向作者/读者索取更多资源

Rheumatology research often involves correlated and clustered data. Treating these data as independent observations can lead to incorrect statistical inference. In this study, data from 633 rheumatoid arthritis patients were analyzed using generalized linear models, while adjusting for factors such as rheumatoid factor positivity and sex, as well as considering additional correlation using generalized linear mixed models and generalized estimating equations. The study found that when correlation was accounted for, standard errors increased, resulting in overestimated effect size, narrower confidence intervals, increased type I error, and potentially misleading results.
Rheumatology research often involves correlated and clustered data. A common error when analyzing these data occurs when instead we treat these data as independent observations. This can lead to incorrect statistical inference. The data used are a subset of the 2017 study from Raheel et al consisting of 633 patients with rheumatoid arthritis (RA) between 1988 and 2007. RA flare and the number of swollen joints served as our binary and continuous outcomes, respectively. Generalized linear models (GLM) were fitted for each, while adjusting for rheumatoid factor (RF) positivity and sex. Additionally, a generalized linear mixed model with a random intercept and a generalized estimating equation were used to model RA flare and the number of swollen joints, respectively, to take additional correlation into account. The GLM's beta coefficients and their 95% confidence intervals (CIs) are then compared to their mixed-effects equivalents. The beta coefficients compared between methodologies are very similar. However, their standard errors increase when correlation is accounted for. As a result, if the additional correlations are not considered, the standard error can be underestimated. This results in an overestimated effect size, narrower CIs, increased type I error, and a smaller P value, thus potentially producing misleading results. It is important to model the additional correlation that occurs in correlated data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据