4.7 Article

Module-based prediction approach for robust inter-study predictions in microarray data

Journal

BIOINFORMATICS
Volume 26, Issue 20, Pages 2586-2593

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btq472

Keywords

-

Funding

  1. National Institutes of Health [KL2 RR024154-04]
  2. University of Pittsburgh
  3. Cooperative Studies Program
  4. Precision Therapeutics Inc., Pittsburgh

Ask authors/readers for more resources

Motivation: Traditional genomic prediction models based on individual genes suffer from low reproducibility across microarray studies due to the lack of robustness to expression measurement noise and gene missingness when they are matched across platforms. It is common that some of the genes in the prediction model established in a training study cannot be matched to another test study because a different platform is applied. The failure of interstudy predictions has severely hindered the clinical applications of microarray. To overcome the drawbacks of traditional gene- based prediction ( GBP) models, we propose a module- based prediction ( MBP) strategy via unsupervised gene clustering. Results: K-means clustering is used to group genes sharing similar expression profiles into gene modules, and small modules are merged into their nearest neighbors. Conventional univariate or multivariate feature selection procedure is applied and a representative gene from each selected module is identified to construct the final prediction model. As a result, the prediction model is portable to any test study as long as partial genes in each module exist in the test study. We demonstrate that K-means cluster sizes generally follow a multinomial distribution and the failure probability of inter-study prediction due to missing genes is diminished by merging small clusters into their nearest neighbors. By simulation and applications of real datasets in inter-study predictions, we show that the proposed MBP provides slightly improved accuracy while is considerably more robust than traditional GBP.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available