4.7 Article

An Importance Weighted Feature Selection Stability Measure

Journal

JOURNAL OF MACHINE LEARNING RESEARCH
Volume 22, Issue -, Pages 1-57

Publisher

MICROTOME PUBL

Keywords

feature selection; selection stability; bi-objective optimization; bioinformatics; feature importance

Ask authors/readers for more resources

Current feature selection methods, especially in high-dimensional data, may suffer from instability, but a new stability measure proposed in this work, which incorporates the importance of selected features in predictive models, has been shown to correct overly optimistic estimates and improve decision-making accuracy.
Current feature selection methods, especially applied to high dimensional data, tend to suffer from instability since marginal modifications in the data may result in largely distinct selected feature sets. Such instability strongly limits a sound interpretation of the selected variables by domain experts. Defining an adequate stability measure is also a research question. In this work, we propose to incorporate into the stability measure the importances of the selected features in predictive models. Such feature importances are directly proportional to feature weights in a linear model. We also consider the generalization to a non-linear setting. We illustrate, theoretically and experimentally, that current stability measures are subject to undesirable behaviors, for example, when they are jointly optimized with predictive accuracy. Results on micro-array and mass-spectrometric data show that our novel stability measure corrects for overly optimistic stability estimates in such a bi-objective context, which leads to improved decision-making. It is also shown to be less prone to the under-or over-estimation of the stability value in feature spaces with groups of highly correlated variables.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available