4.7 Article

Selection of non-zero loadings in sparse principal component analysis

Journal

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS
Volume 162, Issue -, Pages 160-171

Publisher

ELSEVIER
DOI: 10.1016/j.chemolab.2017.01.018

Keywords

Sparse Principal Component Analysis (SPCA); Principal Component Analysis (PCA); Genetic algorithm; Pitprops data; Tennessee Eastman process

Funding

  1. Otto Monsteds Foundation in Denmark
  2. Swedish Research Council [340-2013-5108]

Ask authors/readers for more resources

Principal component analysis (PCA) is a widely accepted procedure for summarizing data through dimensional reduction. In PCA, the selection of the appropriate number of components and the interpretation of those components have been the key challenging features. Sparse principal component analysis (SPCA) is a relatively recent technique proposed for producing principal components with sparse loadings via the variance-sparsity trade-off. Although several techniques for deriving sparse loadings have been offered, no detailed guidelines for choosing the penalty parameters to obtain a desired level of sparsity are provided. In this paper, we propose the use of a genetic algorithm (GA) to select the number of non-zero loadings (NNZL) in each principal component while using SPCA. The proposed approach considerably improves the interpretability of principal components and addresses the difficulty in the selection of NNZL in SPCA. Furthermore, we compare the performance of PCA and SPCA in uncovering the underlying latent structure of the data. The key features of the methodology are assessed through a synthetic example, pitprops data and a comparative study of the benchmark Tennessee Eastman process.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available