期刊
BIOMETRICS
卷 79, 期 2, 页码 1268-1279出版社
WILEY
DOI: 10.1111/biom.13666
关键词
missing at random; missing not at random; score test
Missing data can be divided into three categories: missing completely at random(MCAR), missing at random (MAR), and missing not at random (MNAR). Valid statistical approaches depend on correctly identifying the underlying missingness mechanism. This paper proposes two score tests based on a logistic model and a semiparametric location model to distinguish between the MAR and MNAR mechanisms. The simulation and analysis of HIV data demonstrate the effectiveness of the score tests.
Missing data are frequently encountered in various disciplines and can be divided into three categories: missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR). Valid statistical approaches to missing data depend crucially on correct identification of the underlying missingness mechanism. Although the problem of testing whether this mechanism is MCAR or MAR has been extensively studied, there has been very little research on testing MAR versus MNAR. A critical challenge that is faced when dealing with this problem is the issue of model identification under MNAR. In this paper, under a logistic model for the missing probability, we develop two score tests for the problem of whether the missingness mechanism is MAR or MNAR under a parametric model and a semiparametric location model on the regression function. The implementation of the score tests circumvents the identification issue as it requires only parameter estimation under the null MAR assumption. Our simulations and analysis of human immunodeficiency virus data show that the score tests have well-controlled type I errors and desirable powers.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据