☆ 4.3 Article

Evaluating the Effects of Missing Data Handling Methods on Scale Linking Accuracy

EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT (2023)

期刊

EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT

卷 83, 期 6, 页码 1202-1228

出版社

SAGE PUBLICATIONS INC

DOI: 10.1177/00131644221140941

关键词

item response theory; scale linking; missing data; common item nonequivalent group design

类别

Psychology, Educational Mathematics, Interdisciplinary Applications Psychology, Mathematical

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study evaluates the effects of six different approaches to handling missing data on the accuracy of scale linking based on item response theory. The results show that imputing with a response function, multiple imputation, and full information likelihood produce less errors in scale linking accuracy, while listwise deletion is associated with the most errors.

For large-scale assessments, data are often collected with missing responses. Despite the wide use of item response theory (IRT) in many testing programs, however, the existing literature offers little insight into the effectiveness of various approaches to handling missing responses in the context of scale linking. Scale linking is commonly used in large-scale assessments to maintain scale comparability over multiple forms of a test. Under a common-item nonequivalent group design (CINEG), missing data that occur to common items potentially influence the linking coefficients and, consequently, may affect scale comparability, test validity, and reliability. The objective of this study was to evaluate the effect of six missing data handling approaches, including listwise deletion (LWD), treating missing data as incorrect responses (IN), corrected item mean imputation (CM), imputing with a response function (RF), multiple imputation (MI), and full information likelihood information (FIML), on IRT scale linking accuracy when missing data occur to common items. Under a set of simulation conditions, the relative performance of the six missing data treatment methods under two missing mechanisms was explored. Results showed that RF, MI, and FIML produced less errors for conducting scale linking whereas LWD was associated with the most errors regardless of various testing conditions.

Evaluating the Effects of Missing Data Handling Methods on Scale Linking Accuracy

期刊

EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT

出版社

SAGE PUBLICATIONS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Evaluating the Effects of Missing Data Handling Methods on Scale Linking Accuracy

期刊

EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT

出版社

SAGE PUBLICATIONS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文