☆ 4.6 Article

VARIABLE SELECTION FOR GENERAL INDEX MODELS VIA SLICED INVERSE REGRESSION

ANNALS OF STATISTICS (2014)

Journal

ANNALS OF STATISTICS

Volume 42, Issue 5, Pages 1751-1786

Publisher

INST MATHEMATICAL STATISTICS

DOI: 10.1214/14-AOS1233

Keywords

Interactions; inverse models; sliced inverse regression; sure independence screening; variable selection

Funding

NSF [DMS-10-07762, DMS-11-20368]
Shenzhen Special Fund for Strategic Emerging Industry [ZD201111080127A]
Direct For Mathematical & Physical Scien
Division Of Mathematical Sciences [1120368] Funding Source: National Science Foundation
Division Of Mathematical Sciences
Direct For Mathematical & Physical Scien [1007762] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Variable selection, also known as feature selection in machine learning, plays an important role in modeling high dimensional data and is key to data-driven scientific discoveries. We consider here the problem of detecting influential variables under the general index model, in which the response is dependent of predictors through an unknown function of one or more linear combinations of them. Instead of building a predictive model of the response given combinations of predictors, we model the conditional distribution of predictors given the response. This inverse modeling perspective motivates us to propose a stepwise procedure based on likelihood-ratio tests, which is effective and computationally efficient in identifying important variables without specifying a parametric relationship between predictors and the response. For example, the proposed procedure is able to detect variables with pairwise, three-way or even higher-order interactions among p predictors with a computational time of O(p) instead of O(p(k)) (with k being the highest order of interactions). Its excellent empirical performance in comparison with existing methods is demonstrated through simulation studies as well as real data examples. Consistency of the variable selection procedure when both the number of predictors and the sample size go to infinity is established.

VARIABLE SELECTION FOR GENERAL INDEX MODELS VIA SLICED INVERSE REGRESSION

Journal

ANNALS OF STATISTICS

Publisher

INST MATHEMATICAL STATISTICS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

VARIABLE SELECTION FOR GENERAL INDEX MODELS VIA SLICED INVERSE REGRESSION

Journal

ANNALS OF STATISTICS

Publisher

INST MATHEMATICAL STATISTICS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper