4.4 Article

Parallel Analysis With Categorical Variables: Impact of Category Probability Proportions on Dimensionality Assessment Accuracy

Journal

PSYCHOLOGICAL METHODS
Volume 24, Issue 3, Pages 339-351

Publisher

AMER PSYCHOLOGICAL ASSOC
DOI: 10.1037/met0000171

Keywords

parallel analysis; dimensionality; exploratory factor analysis; principal component analysis; categorical variables

Ask authors/readers for more resources

Parallel analysis (PA) is regarded as one of the most accurate methods to determine the number of factors underlying a set of variables. Commonly, PA is performed on the basis of the variables' product-moment correlation matrix. To improve dimensionality assessments for dichotomous or ordered categorical variables, it has been proposed to replace product-moment correlations with more appropriate coefficients, such as tetrachoric or polychoric correlations. While similar modifications have proven useful for various factor analytic approaches, PA results were not consistently improved. The present article outlines a main reason for this result. Specifically, it explains the dependency of PA results on differing proportions of category probabilities when using tetrachoric or polychoric correlations and shows how to adjust for it by generating appropriate reference eigenvalues. The accuracy of dimensionality assessments of PA accounting for category probability proportions versus not accounting for them is investigated using simulation studies. The results show that the category probability adjusted approach distinctly improves dimensionality assessments. Translational Abstract Assessing the number of latent variables that sufficiently account for the joint variability among a set of observed variables has been an important research topic since Spearman (1904) proposed his factor analytic approach for measuring general intelligence. As a consequence, a variety of dimensionality assessment approaches have been proposed. Parallel analysis (PA) is considered as one of the most accurate methods. It compares the eigenvalues of a sample correlation matrix to the expected eigenvalues of independent variables. In the classical approach introduced by Horn (1965) all variables are assumed as metric. However, because categorical variables are prominent in psychology, it has been proposed to replace PA on the basis of product-moment correlations with PA based on tetrachoric or polychoric correlations, which are designed to account for the variables' ordinal level of measurement. Although this modification is theoretically reasonable, PA results were not consistently improved. The present article outlines a main reason for this outcome. Specifically, it explains the dependency of PA results on the distribution of category probabilities on the basis of the theoretical properties of the different correlation coefficients. Furthermore, it demonstrates which PA approach is most robust to unbalanced category probabilities using simulation studies. As the results show, PA based on tetrachoric/polychoric correlations is superior if, and only if, reference eigenvalues are generated based on variables matching the univariate category probabilities of the empirical sample. Program code for generating appropriate eigenvalues is provided.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available