4.7 Article

Comparison of pre-processing methods for Infinium HumanMethylation450 BeadChip array

Journal

BIOINFORMATICS
Volume 33, Issue 20, Pages 3151-3157

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btx372

Keywords

-

Funding

  1. Movember funds through Prostate Cancer Canada
  2. Ontario Institute for Cancer Research - Government of Ontario
  3. Ontario Institute for Cancer Research through Government of Ontario
  4. Princess Margaret Cancer Centre Foundation
  5. Radiation Medicine Program Academic Enrichment Fund
  6. Canadian Cancer Society Research Scientist Award
  7. Prostate Cancer Canada
  8. Movember Foundation [RS2014-01]
  9. Terry Fox Research Institute New Investigator Award
  10. CIHR New Investigator Award

Ask authors/readers for more resources

Motivation: Microarrays are widely used to quantify DNA methylation because they are economical, require only small quantities of input DNA and focus on well-characterized regions of the genome. However, pre-processing of methylation microarray data is challenging because of confounding factors that include background fluorescence, dye bias and the impact of germline polymorphisms. Therefore, we present valuable insights and a framework for those seeking the most optimal pre-processing method through a data-driven approach. Results: Here, we show that Dasen is the optimal pre-processing methodology for the Infinium HumanMethylation450 BeadChip array in prostate cancer, a frequently employed platform for tumour methylome profiling in both the TCGA and ICGC consortia. We evaluated the impact of 11 pre-processing methods on batch effects, replicate variabilities, sensitivities and sample-to-sample correlations across 809 independent prostate cancer samples, including 150 reported for the first time in this study. Overall, Dasen is the most effective for removing artefacts and detecting biological differences associated with tumour aggressivity. Relative to the raw dataset, it shows a reduction in replicate variances of 67% and 76% for b-and M-values, respectively. Our study provides a unique pre-processing benchmark for the community with an emphasis on biological implications.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available