4.6 Article

High-Dimensional Variable Selection for Survival Data

Journal

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
Volume 105, Issue 489, Pages 205-217

Publisher

TAYLOR & FRANCIS INC
DOI: 10.1198/jasa.2009.tm08622

Keywords

Forest; Maximal subtree; Minimal depth; Random survival forest; Tree; VIMP

Funding

  1. National Heart. Lung. and Blood Institute [CAN 8324207]
  2. Department of Defense [BC085325]

Ask authors/readers for more resources

The minimal depth of a maximal subtree IN a dimensionless order statistic measuring the predictiveness of a variable in a survival tree We derive the distribution of the minimal depth and use it lot high-dimensional variable selection using random survival forests In big p and small n problems (where p is the dimension and n Is the sample size). the distribution of the minimal depth reveals a ceiling effect in which a tree simply cannot be grown deep enough to properly identify predictive variables Motivated by this limitation. we develop a new regularized algorithm. termed RSF-Variable Hunting This algorithm exploits maximal subtrees for effective variable selection under such scenarios Several applications are presented demonstrating the methodology. including the problem of gene selection using microarray data In this work we focus only on survival settings. although out methodology also applies to other random forests applications. including regression and classification settings All examples presented here use the R-software package randomSurvivalForest

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available