4.2 Article

Random Subspace Method for high-dimensional regression with the R package regRSM

Journal

COMPUTATIONAL STATISTICS
Volume 31, Issue 3, Pages 943-972

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s00180-016-0658-2

Keywords

Random Subspace Method; High-dimensional regression; Variable importance measure; Generalized Information Criterion; MPI; R

Funding

  1. European Union [POKL.04.01.01-00-051/10-00]

Ask authors/readers for more resources

Model selection and variable importance assessment in high-dimensional regression are among the most important tasks in contemporary applied statistics. In our procedure, implemented in the package regRSM, the Random Subspace Method (RSM) is used to construct a variable importance measure. The variables are ordered with respect to the measures computed in the first step using the RSM and then, from the hierarchical list of models given by the ordering, the final subset of variables is chosen using information criteria or validation set. Modifications of the original method such as the weighted Random Subspace Method and the version with initial screening of redundant variables are discussed. We developed parallel implementations which enable to reduce the computation time significantly. In this paper, we give a brief overview of the methodology, demonstrate the package's functionality and present a comparative study of the proposed algorithm and the competitive methods like lasso or CAR scores. In the performance tests the computational times for parallel implementations are compared.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available