期刊
GENETIC EPIDEMIOLOGY
卷 36, 期 3, 页码 253-262出版社
WILEY-BLACKWELL
DOI: 10.1002/gepi.21618
关键词
evaluation; CNV-calling strategies; family-based GWAS
资金
- Johns Hopkins University (JHU) Center for Inherited Disease Research (CIDR) [HHSN268200782096C]
- NIDCR [R01-DE 014899, R01-DE09551, R01-DE12101]
- Danish National Research Foundation
- Danish Pharmacist's Fund
- Egmont Foundation
- March of Dimes Birth Defects Foundation
- Augustinus Foundation
- Health Fund of the Danish Health Insurance Societies
- [T32MH015169]
- [U01DE018903]
- [U01HG004423]
- [U01HG004446]
A major concern for all copy number variation (CNV) detection algorithms is their reliability and repeatability. However, it is difficult to evaluate the reliability of CNV-calling strategies due to the lack of gold-standard data that would tell us which CNVs are real. We propose that if CNVs are called in duplicate samples, or inherited from parent to child, then these can be considered validated CNVs. We used two large family-based genome-wide association study (GWAS) datasets from the GENEVA consortium to look at concordance rates of CNV calls between duplicate samples, parent-child pairs, and unrelated pairs. Our goal was to make recommendations for ways to filter and use CNV calls in GWAS datasets that do not include family data. We used PennCNV as our primary CNV-calling algorithm, and tested CNV calls using different datasets and marker sets, and with various filters on CNVs and samples. Using the Illumina core HumanHap550 single nucleotide polymorphism (SNP) set, we saw duplicate concordance rates of approximately 55% and parent-child transmission rates of approximately 28% in our datasets. GC model adjustment and sample quality filtering had little effect on these reliability measures. Stratification on CNV size and DNA sample type did have some effect. Overall, our results show that it is probably not possible to find a CNV-calling strategy (including filtering and algorithm) that will give us a set of reliable CNV calls using current chip technologies. But if we understand the error process, we can still use CNV calls appropriately in genetic association studies. Genet. Epidemiol. 36:253-262, 2012. (C) 2012 Wiley Periodicals, Inc.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据