4.3 Article Proceedings Paper

Automatic model selection for high-dimensional survival analysis

Journal

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/00949655.2014.929131

Keywords

model selection; feature selection; algorithm configuration; survival analysis; machine learning; high-dimensional data; racing; parameter tuning

Funding

  1. Deutsche Forschungsgemeinschaft (DFG) within the Collaborative Research Center [SFB 876, SFB 823]

Ask authors/readers for more resources

Many different models for the analysis of high-dimensional survival data have been developed over the past years. While some of the models and implementations come with an internal parameter tuning automatism, others require the user to accurately adjust defaults, which often feels like a guessing game. Exhaustively trying out all model and parameter combinations will quickly become tedious or infeasible in computationally intensive settings, even if parallelization is employed. Therefore, we propose to use modern algorithm configuration techniques, e.g. iterated F-racing, to efficiently move through the model hypothesis space and to simultaneously configure algorithm classes and their respective hyperparameters. In our application we study four lung cancer microarray data sets. For these we configure a predictor based on five survival analysis algorithms in combination with eight feature selection filters. We parallelize the optimization and all comparison experiments with the BatchJobs and BatchExperiments R packages.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available