4.7 Review

Performance evaluation of transcriptomics data normalization for survival risk prediction

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 22, Issue 6, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbab257

Keywords

penalized Cox regression; survival prediction; data normalization; handling effects; microRNA microarray; transcriptomics data

Funding

  1. National Institutes of Health [CA214845, CA008748]

Ask authors/readers for more resources

One key feature of transcriptomics data is handling effects, which have a significant impact on survival prediction. Among various normalization methods, quantile normalization tends to underperform compared to median normalization and variance stabilizing normalization in survival prediction. It is important to evaluate normalization methods in the context of downstream analysis and applying median normalization may improve the development of survival predictors.
One pivotal feature of transcriptomics data is the unwanted variations caused by disparate experimental handling, known as handling effects. Various data normalization methods were developed to alleviate the adverse impact of handling effects in the setting of differential expression analysis. However, little research has been done to evaluate their performance in the setting of survival outcome prediction, an important analysis goal for transcriptomics data in biomedical research. Leveraging a unique pair of datasets for the same set of tumor samples-one with handling effects and the other without, we developed a benchmarking tool for conducting such an evaluation in microRNA microarrays. We applied this tool to evaluate the performance of three popular normalization methods-quantile normalization, median normalization and variance stabilizing normalization-in survival prediction using various approaches for model building and designs for sample assignment. We showed that handling effects can have a strong impact on survival prediction and that quantile normalization, a most popular method in current practice, tends to underperform median normalization and variance stabilizing normalization. We demonstrated with a small example the reason for quantile normalization's poor performance in this setting. Our finding highlights the importance of putting normalization evaluation in the context of the downstream analysis setting and the potential of improving the development of survival predictors by applying median normalization. We make available our benchmarking tool for performing such evaluation on additional normalization methods in connection with prediction modeling approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available