4.7 Article

An instance voting approach to feature selection

期刊

INFORMATION SCIENCES
卷 504, 期 -, 页码 449-469

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2019.07.018

关键词

Feature selection; Filter-based method; Set-covering problem; Instance voting; Graph modularity; Priority coverage

资金

  1. Signals and Systems for Life Science (SSLS) scheme of Ministry of Human Resource Development, Government of India [4-23/2014-TS.I]

向作者/读者索取更多资源

In this work, we address the problem of supervised feature selection (FS) for high-dimensional datasets with a small number of instances. Here, we propose a novel heuristic FS approach, Conditional Priority Coverage Maximization (CPCM) which seeks to leverage the local information provided by the small set of instances. We define the vote assigned by an instance to a feature as the local relevance of the latter. Also, we show that the proposed voting scheme is asymptotically related to the Bayes' decision rule for minimum risk classification. Next, we exploit the instance votes for feature selection by posing it as a set-covering problem - we seek to select a subset of features such that they can together cover the instances. This approach avoids the selection of redundant features, while selecting relevant ones. In addition, we formulate the stopping criterion to select a compact subset of features. Through experiments on synthetic and real datasets, we demonstrated that CPCM outperforms other graph based FS techniques and state-of-the-art FS approaches employing mutual information (MI). Further, we evaluated the stability of CPCM to minor variations in the training data and found it to be reasonably robust. (C) 2019 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据