Journal
GENOME RESEARCH
Volume 11, Issue 11, Pages 1878-1887Publisher
COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/gr.190001
Keywords
-
Categories
Funding
- NHLBI NIH HHS [HL 5448] Funding Source: Medline
- NIGMS NIH HHS [GM 56515] Funding Source: Medline
Ask authors/readers for more resources
Gene expression studies bridge the gap between DNA information and trait information by dissecting biochemical pathways into intermediate components between genotype and phenotype. These Studies open new avenues for identifying complex disease genes and biomarkers for disease diagnosis and for assessing drug efficacy and toxicity. However, the majority of analytical methods applied to gene expression data are not efficient for biomarker identification and disease diagnosis. In this paper, we propose a general framework to incorporate feature (gene) selection into pattern recognition in the process to identify biomarkers. Using this framework, we develop three feature wrappers that search through the space Of feature subsets using the classification error as measure of goodness for a particular feature subset being wrapped around: linear discriminant analysis, logistic regression, and support vector machines. To effectively carry Out this computationally intensive search process, we employ sequential forward search and Sequential forward floating search algorithms. To evaluate the performance of feature selection for biomarker identification we have applied the proposed methods to three data sets. The preliminary results demonstrate that very high classification accuracy can be attained by identified composite classifiers with several biomarkers.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available