4.5 Article Proceedings Paper

One stop shopping: feature selection, classification and prediction in a single step

Journal

JOURNAL OF CHEMOMETRICS
Volume 25, Issue 3, Pages 116-129

Publisher

WILEY
DOI: 10.1002/cem.1358

Keywords

boosting; classification; feature selection; genetic algorithms; machine learning; pattern recognition; principal component analysis; transverse learning

Ask authors/readers for more resources

We report on the application of a genetic algorithm (GA) for pattern recognition that uses both supervised and transverse learning to mine spectroscopic and proteomic data. The pattern recognition GA selects features that optimize the separation of the classes in a plot of the two or three largest principal components of the data. For training sets with small amounts of labeled data (i.e. data points tagged with a class label) and large amounts of unlabeled data (i.e. data points that are not tagged with a class label), this approach is preferred, as our results show, information in the unlabeled data is used by the fitness function to guide feature selection. The advantages of incorporating transverse learning into the fitness function of the pattern recognition GA have been evaluated in two recently published studies by our group. In one study, Raman spectroscopy and the pattern recognition GA were used to develop a potential method to discriminate hardwoods, softwoods and tropical woods. In a second study, biopsy material of small round blue cell tumors analyzed by cDNA microarrays was identified as to type (Ewings sarcoma, Burkitt's lymphoma, neuroblastoma and rhabdomyosarcoma) through supervised learning implemented by the pattern recognition GA. Copyright (C) 2011 John Wiley & Sons, Ltd.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available