4.7 Article

A Bootstrap Based Neyman-Pearson Test for Identifying Variable Importance

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2014.2320415

Keywords

Feature selection (FS); Neyman-Pearson

Funding

  1. National Science Foundation CAREER [0845827]
  2. National Science Foundation [1120622, ECCS-0926159, ECCS-1310496]
  3. Department of Energy [SC004335]
  4. Direct For Biological Sciences
  5. Div Of Biological Infrastructure [0845827] Funding Source: National Science Foundation
  6. Direct For Mathematical & Physical Scien
  7. Division Of Mathematical Sciences [1120622] Funding Source: National Science Foundation
  8. Division Of Mathematical Sciences
  9. Direct For Mathematical & Physical Scien [1418744] Funding Source: National Science Foundation

Ask authors/readers for more resources

Selection of most informative features that leads to a small loss on future data are arguably one of the most important steps in classification, data analysis and model selection. Several feature selection (FS) algorithms are available; however, due to noise present in any data set, FS algorithms are typically accompanied by an appropriate cross-validation scheme. In this brief, we propose a statistical hypothesis test derived from the Neyman-Pearson lemma for determining if a feature is statistically relevant. The proposed approach can be applied as a wrapper to any FS algorithm, regardless of the FS criteria used by that algorithm, to determine whether a feature belongs in the relevant set. Perhaps more importantly, this procedure efficiently determines the number of relevant features given an initial starting point. We provide freely available software implementations of the proposed methodology.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available