4.5 Article

ST-PLS: a multi-directional nearest shrunken centroid type classi er via PLS

Journal

JOURNAL OF CHEMOMETRICS
Volume 22, Issue 1-2, Pages 54-62

Publisher

WILEY
DOI: 10.1002/cem.1101

Keywords

classi cation; gene expression; soft-thresholding; variable selection

Ask authors/readers for more resources

The nearest shrunken centroid (NSC) Classier is successfully applied for class prediction in a wide range of studies based on microarray data. The contribution from seemingly irrelevant variables to the classi er is minimized by the so-called soft-thresholding property of the approach. In this paper, we rst show that for the two-class prediction problem, the NSC Classi er is similar to a one-component discriminant partial least squares (PLS) model with soft-shrinkage of the loading weights. Then we introduce the soft-threshold-PLS (ST-PLS) as a general discriminant-PLS model with soft-thresholding of the loading weights of multiple latent components. This method is especially suited for classi cation and variable selection when the number of variables is large compared to the number of samples, which is typical for gene expression data. A characteristic feature of ST-PLS is the ability to identify important variables in multiple directions in the variable space. Both the ST-PLS and the NSC classi ers are applied to four real data sets. The results indicate that ST-PLS performs better than the shrunken centroid approach if there are several directions in the variable space which are important for classi cation, and there are strong dependencies between subsets of variables. Copyright (c) 2007 John Wiley & Sons, Ltd.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available