☆ 4.6 Article

Analysis of complexity indices for classification problems: Cancer gene expression data

NEUROCOMPUTING (2012)

期刊

NEUROCOMPUTING

卷 75, 期 1, 页码 33-42

出版社

ELSEVIER

DOI: 10.1016/j.neucom.2011.03.054

关键词

Classification; Gene expression data; Complexity indices; Linear separability

类别

Computer Science, Artificial Intelligence

资金

Brazilian research agency CNPq
Brazilian research agency CAPES
Brazilian research agency FACEPE
Brazilian research agency UFABC

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Currently, cancer diagnosis at a molecular level has been made possible through the analysis of gene expression data. More specifically, one usually uses machine learning (ML) techniques to build, from cancer gene expression data, automatic diagnosis models (classifiers). Cancer gene expression data often present some characteristics that can have a negative impact in the generalization ability of the classifiers generated. Some of these properties are data sparsity and an unbalanced class distribution. We investigate the results of a set of indices able to extract the intrinsic complexity information from the data. Such measures can be used to analyze, among other things, which particular characteristics of cancer gene expression data mostly impact the prediction ability of support vector machine classifiers. In this context, we also show that, by applying a proper feature selection procedure to the data, one can reduce the influence of those characteristics in the error rates of the classifiers induced. (C) 2011 Elsevier B.V. All rights reserved.

Analysis of complexity indices for classification problems: Cancer gene expression data

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Analysis of complexity indices for classification problems: Cancer gene expression data

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文