4.7 Article

Missing values in multi-level simultaneous component analysis

Journal

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS
Volume 129, Issue -, Pages 21-32

Publisher

ELSEVIER
DOI: 10.1016/j.chemolab.2013.05.010

Keywords

Multi-set component analysis; Missing data; Regularization; Imputation

Ask authors/readers for more resources

Component analysis of data with missing values is often performed with algorithms of iterative imputation. However, this approach is prone to overfitting problems. As an alternative, Josse et al. (2009) proposed a regularized algorithm in the framework of Principal Component Analysis (PCA). Here we use a similar approach to deal with missing values in multi-level simultaneous component analysis (MLSCA), a method dedicated to explore multivariate multilevel data (e.g., individuals nested within groups). We discuss the properties of the regularized algorithm, the expected behavior under the missing (completely) at random (M(C)AR) mechanisms and possible dysmonotony problems. We explain the importance of separating the deviations due to sampling fluctuations and due to missing data. On the basis of a comparative extensive simulation study, we show that the regularized method generally performs well and clearly outperforms an EM-type of algorithm. (C) 2013 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available