期刊
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS
卷 129, 期 -, 页码 21-32出版社
ELSEVIER
DOI: 10.1016/j.chemolab.2013.05.010
关键词
Multi-set component analysis; Missing data; Regularization; Imputation
Component analysis of data with missing values is often performed with algorithms of iterative imputation. However, this approach is prone to overfitting problems. As an alternative, Josse et al. (2009) proposed a regularized algorithm in the framework of Principal Component Analysis (PCA). Here we use a similar approach to deal with missing values in multi-level simultaneous component analysis (MLSCA), a method dedicated to explore multivariate multilevel data (e.g., individuals nested within groups). We discuss the properties of the regularized algorithm, the expected behavior under the missing (completely) at random (M(C)AR) mechanisms and possible dysmonotony problems. We explain the importance of separating the deviations due to sampling fluctuations and due to missing data. On the basis of a comparative extensive simulation study, we show that the regularized method generally performs well and clearly outperforms an EM-type of algorithm. (C) 2013 Elsevier B.V. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据