期刊
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS
卷 3, 期 3, 页码 333-345出版社
INDERSCIENCE ENTERPRISES LTD
DOI: 10.1504/IJDMB.2009.026705
关键词
feature extraction; microarray expression data; semantic similarity; bioinformatics
资金
- NSF [DBI-0234895]
- NIH [I P20 GM067650-01A1]
- NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES [P20GM067650] Funding Source: NIH RePORTER
Previous studies have proven that it is feasible to build sample classifiers using gene expression profiles. To build an effective sample classifier, dimension reduction process is necessary since classic pattern recognition algorithms do not work well in high dimensional space. In this paper, we present a novel feature extraction algorithm by integrating microarray expression data with Gene Ontology (GO). Applying semantic similarity measures, we identify the groups of genes, called virtual genes, which potentially interact with each other for a biological function. The correlation in expressions of virtual genes is used to classify samples. For colon cancer data, this approach significantly improved the classification accuracy by more than 10%.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据