☆ 4.5 Review

A review of feature selection methods on synthetic data

KNOWLEDGE AND INFORMATION SYSTEMS (2013)

期刊

KNOWLEDGE AND INFORMATION SYSTEMS

卷 34, 期 3, 页码 483-519

出版社

SPRINGER LONDON LTD

DOI: 10.1007/s10115-012-0487-8

关键词

Feature selection; Filters; Embedded methods; Wrappers; Synthetic datasets

类别

Computer Science, Artificial Intelligence Computer Science, Information Systems

资金

Spanish Ministerio de Ciencia e Innovacion [TIN 2009-02402]
European Union ERDF
Xunta de Galicia under Plan I2C Grant Program

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

With the advent of high dimensionality, adequate identification of relevant features of the data has become indispensable in real-world scenarios. In this context, the importance of feature selection is beyond doubt and different methods have been developed. However, with such a vast body of algorithms available, choosing the adequate feature selection method is not an easy-to-solve question and it is necessary to check their effectiveness on different situations. Nevertheless, the assessment of relevant features is difficult in real datasets and so an interesting option is to use artificial data. In this paper, several synthetic datasets are employed for this purpose, aiming at reviewing the performance of feature selection methods in the presence of a crescent number or irrelevant features, noise in the data, redundancy and interaction between attributes, as well as a small ratio between number of samples and number of features. Seven filters, two embedded methods, and two wrappers are applied over eleven synthetic datasets, tested by four classifiers, so as to be able to choose a robust method, paving the way for its application to real datasets.

A review of feature selection methods on synthetic data

期刊

KNOWLEDGE AND INFORMATION SYSTEMS

出版社

SPRINGER LONDON LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A review of feature selection methods on synthetic data

期刊

KNOWLEDGE AND INFORMATION SYSTEMS

出版社

SPRINGER LONDON LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文