4.7 Article

Linear regression and two-class classification with gene expression data

Journal

BIOINFORMATICS
Volume 19, Issue 16, Pages 2072-2078

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btg283

Keywords

-

Funding

  1. NHLBI NIH HHS [R01-HL65462] Funding Source: Medline
  2. NATIONAL HEART, LUNG, AND BLOOD INSTITUTE [R01HL065462] Funding Source: NIH RePORTER

Ask authors/readers for more resources

Motivation: Using gene expression data to classify (or predict) tumor types has received much research attention recently. Due to some special features of gene expression data, several new methods have been proposed, including the weighted voting scheme of Golub et al., the compound covariate method of Hedenfalk et al. (originally proposed by Tukey), and the shrunken centroids method of Tibshirani et al. These methods look different and are more or less ad hoc. Results: We point out a close connection of the three methods with a linear regression model. Casting the classification problem in the general framework of linear regression naturally leads to new alternatives, such as partial least squares (PLS) methods and penalized PLS (PPLS) methods. Using two real data sets, we show the competitive performance of our new methods when compared with the other three methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available