☆ 4.5 Article

Efficient feature selection filters for high-dimensional data

PATTERN RECOGNITION LETTERS (2012)

期刊

PATTERN RECOGNITION LETTERS

卷 33, 期 13, 页码 1794-1804

出版社

ELSEVIER

DOI: 10.1016/j.patrec.2012.05.019

关键词

Feature selection; Filters; Dispersion measures; Similarity measures; High-dimensional data

类别

Computer Science, Artificial Intelligence

资金

Polytechnic Institute of Lisbon [SFRH/PROTEC/67605/2010]
FCT project [PEst-OE/EEI/LA0008/2011]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Feature selection is a central problem in machine learning and pattern recognition. On large datasets (in terms of dimension and/or number of instances), using search-based or wrapper techniques can be cornputationally prohibitive. Moreover, many filter methods based on relevance/redundancy assessment also take a prohibitively long time on high-dimensional datasets. In this paper, we propose efficient unsupervised and supervised feature selection/ranking filters for high-dimensional datasets. These methods use low-complexity relevance and redundancy criteria, applicable to supervised, semi-supervised, and unsupervised learning, being able to act as pre-processors for computationally intensive methods to focus their attention on smaller subsets of promising features. The experimental results, with up to 10(5) features, show the time efficiency of our methods, with lower generalization error than state-of-the-art techniques, while being dramatically simpler and faster. (c) 2012 Elsevier B.V. All rights reserved.

Efficient feature selection filters for high-dimensional data

期刊

PATTERN RECOGNITION LETTERS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Efficient feature selection filters for high-dimensional data

期刊

PATTERN RECOGNITION LETTERS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文