4.3 Article

Gap hunting to characterize clustered probe signals in Illumina methylation array data

期刊

EPIGENETICS & CHROMATIN
卷 9, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s13072-016-0107-z

关键词

Illumina HumanMethylation450 BeadChip; 450k Array; Gap hunting; SNP; Polymorphic CpG; Epigenome-wide association studies

资金

  1. Autism Speaks [7659]
  2. NIEHS [R01ES01900, R01ES017646]
  3. Centers for Disease Control and Prevention [U10DD000180, U10DD000181, U10DD000182, U10DD000183, U10DD000184, U10DD000498]
  4. Burroughs-Wellcome Trust training grant: Maryland, Genetics, Epidemiology and Medicine (MD-GEM)

向作者/读者索取更多资源

Background: The Illumina 450k array has been widely used in epigenetic association studies. Current quality-control (QC) pipelines typically remove certain sets of probes, such as those containing a SNP or with multiple mapping locations. An additional set of potentially problematic probes are those with DNA methylation distributions characterized by two or more distinct clusters separated by gaps. Data-driven identification of such probes may offer additional insights for downstream analyses. Results: We developed a procedure, termed gap hunting, to identify probes showing clustered distributions. Among 590 peripheral blood samples from the Study to Explore Early Development, we identified 11,007 gap probes. The vast majority (9199) are likely attributed to an underlying SNP(s) or other variant in the probe, although SNP-affected probes exist that do not produce a gap signals. Specific factors predict which SNPs lead to gap signals, including type of nucleotide change, probe type, DNA strand, and overall methylation state. These expected effects are demonstrated in paired genotype and 450k data on the same samples. Gap probes can also serve as a surrogate for the local genetic sequence on a haplotype scale and can be used to adjust for population stratification. Conclusions: The characteristics of gap probes reflect potentially informative biology. QC pipelines may benefit from an efficient data-driven approach that flags gap probes, rather than filtering such probes, followed by careful interpretation of downstream association analyses. Our results should translate directly to the recently released Illumina EPIC array given the similar chemistry and content design.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据