Journal
COMPUTATIONAL STATISTICS
Volume 31, Issue 3, Pages 943-972Publisher
SPRINGER HEIDELBERG
DOI: 10.1007/s00180-016-0658-2
Keywords
Random Subspace Method; High-dimensional regression; Variable importance measure; Generalized Information Criterion; MPI; R
Categories
Funding
- European Union [POKL.04.01.01-00-051/10-00]
Ask authors/readers for more resources
Model selection and variable importance assessment in high-dimensional regression are among the most important tasks in contemporary applied statistics. In our procedure, implemented in the package regRSM, the Random Subspace Method (RSM) is used to construct a variable importance measure. The variables are ordered with respect to the measures computed in the first step using the RSM and then, from the hierarchical list of models given by the ordering, the final subset of variables is chosen using information criteria or validation set. Modifications of the original method such as the weighted Random Subspace Method and the version with initial screening of redundant variables are discussed. We developed parallel implementations which enable to reduce the computation time significantly. In this paper, we give a brief overview of the methodology, demonstrate the package's functionality and present a comparative study of the proposed algorithm and the competitive methods like lasso or CAR scores. In the performance tests the computational times for parallel implementations are compared.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available