4.3 Article

Using Family Data as a Verification Standard to Evaluate Copy Number Variation Calling Strategies for Genetic Association Studies

期刊

GENETIC EPIDEMIOLOGY
卷 36, 期 3, 页码 253-262

出版社

WILEY-BLACKWELL
DOI: 10.1002/gepi.21618

关键词

evaluation; CNV-calling strategies; family-based GWAS

资金

  1. Johns Hopkins University (JHU) Center for Inherited Disease Research (CIDR) [HHSN268200782096C]
  2. NIDCR [R01-DE 014899, R01-DE09551, R01-DE12101]
  3. Danish National Research Foundation
  4. Danish Pharmacist's Fund
  5. Egmont Foundation
  6. March of Dimes Birth Defects Foundation
  7. Augustinus Foundation
  8. Health Fund of the Danish Health Insurance Societies
  9. [T32MH015169]
  10. [U01DE018903]
  11. [U01HG004423]
  12. [U01HG004446]

向作者/读者索取更多资源

A major concern for all copy number variation (CNV) detection algorithms is their reliability and repeatability. However, it is difficult to evaluate the reliability of CNV-calling strategies due to the lack of gold-standard data that would tell us which CNVs are real. We propose that if CNVs are called in duplicate samples, or inherited from parent to child, then these can be considered validated CNVs. We used two large family-based genome-wide association study (GWAS) datasets from the GENEVA consortium to look at concordance rates of CNV calls between duplicate samples, parent-child pairs, and unrelated pairs. Our goal was to make recommendations for ways to filter and use CNV calls in GWAS datasets that do not include family data. We used PennCNV as our primary CNV-calling algorithm, and tested CNV calls using different datasets and marker sets, and with various filters on CNVs and samples. Using the Illumina core HumanHap550 single nucleotide polymorphism (SNP) set, we saw duplicate concordance rates of approximately 55% and parent-child transmission rates of approximately 28% in our datasets. GC model adjustment and sample quality filtering had little effect on these reliability measures. Stratification on CNV size and DNA sample type did have some effect. Overall, our results show that it is probably not possible to find a CNV-calling strategy (including filtering and algorithm) that will give us a set of reliable CNV calls using current chip technologies. But if we understand the error process, we can still use CNV calls appropriately in genetic association studies. Genet. Epidemiol. 36:253-262, 2012. (C) 2012 Wiley Periodicals, Inc.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据