期刊
SOCIOLOGICAL METHODS & RESEARCH
卷 47, 期 3, 页码 507-531出版社
SAGE PUBLICATIONS INC
DOI: 10.1177/0049124116638107
关键词
pseudo-R-2; logistic regression; goodness-of-fit; benchmarks; reporting
The literature proposes numerous so-called pseudo-R-2 measures for evaluating goodness of fit in regression models with categorical dependent variables. Unlike ordinary least square-R-2, log-likelihood-based pseudo-R(2)s do not represent the proportion of explained variance but rather the improvement in model likelihood over a null model. The multitude of available pseudo-R-2 measures and the absence of benchmarks often lead to confusing interpretations and unclear reporting. Drawing on a meta-analysis of 274 published logistic regression models as well as simulated data, this study investigates fundamental differences of distinct pseudo-R-2 measures, focusing on their dependence on basic study design characteristics. Results indicate that almost all pseudo-R(2)s are influenced to some extent by sample size, number of predictor variables, and number of categories of the dependent variable and its distribution asymmetry. Hence, an interpretation by goodness-of-fit benchmark values must explicitly consider these characteristics. The authors derive a set of goodness-of-fit benchmark values with respect to ranges of sample size and distribution of observations for this measure. This study raises awareness of fundamental differences in characteristics of pseudo-R(2)s and the need for greater precision in reporting these measures.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据