4.4 Article

Clustering by principal component analysis with Gaussian kernel in high-dimension, low-sample-size settings

Journal

JOURNAL OF MULTIVARIATE ANALYSIS
Volume 185, Issue -, Pages -

Publisher

ELSEVIER INC
DOI: 10.1016/j.jmva.2021.104779

Keywords

HDLSS; Non-linear PCA; PC score; Radial basis function kernel; Spherical data

Ask authors/readers for more resources

This paper examines clustering based on kernel principal component analysis (KPCA) for high-dimension, low-sample-size (HDLSS) data. The study provides theoretical reasons for the effectiveness of the Gaussian kernel in clustering high-dimensional data, and explores the choice of scale parameter for optimal KPCA performance with the Gaussian kernel. Finally, the clustering performance is tested using microarray data sets.
In this paper, we consider clustering based on the kernel principal component analysis (KPCA) for high-dimension, low-sample-size (HDLSS) data. We give theoretical reasons why the Gaussian kernel is effective for clustering high-dimensional data. In addition, we discuss a choice of the scale parameter yielding a high performance of the KPCA with the Gaussian kernel. Finally, we test the performance of the clustering by using microarray data sets. (C) 2021 The Author(s). Published by Elsevier Inc.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available