4.5 Article Proceedings Paper

One stop shopping: feature selection, classification and prediction in a single step

期刊

JOURNAL OF CHEMOMETRICS
卷 25, 期 3, 页码 116-129

出版社

WILEY
DOI: 10.1002/cem.1358

关键词

boosting; classification; feature selection; genetic algorithms; machine learning; pattern recognition; principal component analysis; transverse learning

向作者/读者索取更多资源

We report on the application of a genetic algorithm (GA) for pattern recognition that uses both supervised and transverse learning to mine spectroscopic and proteomic data. The pattern recognition GA selects features that optimize the separation of the classes in a plot of the two or three largest principal components of the data. For training sets with small amounts of labeled data (i.e. data points tagged with a class label) and large amounts of unlabeled data (i.e. data points that are not tagged with a class label), this approach is preferred, as our results show, information in the unlabeled data is used by the fitness function to guide feature selection. The advantages of incorporating transverse learning into the fitness function of the pattern recognition GA have been evaluated in two recently published studies by our group. In one study, Raman spectroscopy and the pattern recognition GA were used to develop a potential method to discriminate hardwoods, softwoods and tropical woods. In a second study, biopsy material of small round blue cell tumors analyzed by cDNA microarrays was identified as to type (Ewings sarcoma, Burkitt's lymphoma, neuroblastoma and rhabdomyosarcoma) through supervised learning implemented by the pattern recognition GA. Copyright (C) 2011 John Wiley & Sons, Ltd.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据