☆ 4.3 Article

Performance of Feature Selection Methods

CURRENT GENOMICS (2009)

期刊

CURRENT GENOMICS

卷 10, 期 6, 页码 365-374

出版社

BENTHAM SCIENCE PUBL LTD

DOI: 10.2174/138920209789177629

关键词

类别

Biochemistry & Molecular Biology Genetics & Heredity

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.

Performance of Feature Selection Methods

期刊

CURRENT GENOMICS

出版社

BENTHAM SCIENCE PUBL LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Performance of Feature Selection Methods

期刊

CURRENT GENOMICS

出版社

BENTHAM SCIENCE PUBL LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文