4.3 Article

Optimal two-stage genotyping designs for genome-wide association scans

期刊

GENETIC EPIDEMIOLOGY
卷 30, 期 4, 页码 356-368

出版社

WILEY
DOI: 10.1002/gepi.20150

关键词

sNPs; experimental design; whole genome association studies; multiple comparisons

资金

  1. NCI NIH HHS [P01 CA 17054-27A2, CA63464] Funding Source: Medline
  2. NHGRI NIH HHS [HG002790] Funding Source: Medline
  3. NIEHS NIH HHS [5P30 ES07048] Funding Source: Medline
  4. NIGMS NIH HHS [GM58897] Funding Source: Medline
  5. Direct For Computer & Info Scie & Enginr
  6. Division of Computing and Communication Foundations [0829882] Funding Source: National Science Foundation

向作者/读者索取更多资源

The much-anticipated fixed-array, genome-wide SNP genotyping technologies make large-scale genome-wide association scans now possible for large numbers of subjects. In this paper we reconsider the problem (Satagopan and Elston [20031 Genet Epidemiol 25:149-157) of optimizing a two-stage genotyping design to deal with important new issues that are relevant when studies are expanded from candidate gene size to a genome-wide scale. We investigate how the basic two-stage genotyping approach, in which all markers are genotyped in an initial group of subjects (stage 1) and only the promising markers are genotyped in additional subjects (stage 11), can be used to reduce genotyping cost in a genome-wide case-control association study even after allowing for much higher per genotype costs using specially designed assays in stage 11, compared to the fixed array of SNPs used in stage I. In addition, we consider the problem of using measured SNPs to make (imperfect) prediction of unmeasured SNPs for association tests of all SNPs (measured or unmeasured) genome wide and the utility of expanding genotyping densities in stage 11 in the regions where significant associations were detected in stage I. Under a set of reasonable but conservative assumptions, we derive optimal two-stage design configurations (sample sizes and the thresholds of significance in both stages) with these optimal designs depending both on the total number of markers tested and upon the ratios of cost in stage 11 versus stage I. In addition we show how existing software for power and sample size calculations can be used for the purpose of designing two-stage studies, for a wide range of assumptions about the number of markers genotyped and the costs of genotyping in each stage of the study.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据