4.7 Review

Combining clinical and molecular data in regression prediction models: insights from a simulation study

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 21, Issue 6, Pages 1904-1919

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbz136

Keywords

prediction models; data integration; regularized regression

Funding

  1. German Research Foundation (DFG) [BO3139/4-2, SA580/8-2]
  2. Uninett/Sigma2 [NN9480K]

Ask authors/readers for more resources

Data integration, i.e. the use of different sources of information for data analysis, is becoming one of the most important topics in modern statistics. Especially in, but not limited to, biomedical applications, a relevant issue is the combination of low-dimensional (e.g. clinical data) and high-dimensional (e.g. molecular data such as gene expressions) data sources in a prediction model. Not only the different characteristics of the data, but also the complex correlation structure within and between the two data sources, pose challenging issues. In this paper, we investigate these issues via simulations, providing some useful insight into strategies to combine low- and high-dimensional data in a regression prediction model. In particular, we focus on the effect of the correlation structure on the results, while accounting for the influence of our specific choices in the design of the simulation study.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available