☆ 4.6 Article

A New Statistic to Evaluate Imputation Reliability

PLOS ONE (2010)

期刊

PLOS ONE

卷 5, 期 3, 页码 -

出版社

PUBLIC LIBRARY SCIENCE

DOI: 10.1371/journal.pone.0009697

关键词

类别

Multidisciplinary Sciences

资金

NIH Genes, Environment and Health Initiative [GEI] [U01 HG004422, U01HG004438]
Gene Environment Association Studies (GENEVA) under GEI
GENEVA Coordinating Center [U01 HG004446]
Collaborative Study on the Genetics of Alcoholism [COGA
U10 AA008401]
Collaborative Genetic Study of Nicotine Dependence [COGEND
P01 CA089392]
Family Study of Cocaine Dependence [FSCD
R01 DA013423]
National Institute on Alcohol Abuse and Alcoholism
National Institute on Drug Abuse
NIH [HHSN268200782096C]
National Institute of Mental Health Center [550K]
Pritzker Neuropsychiatric Disorders Research Fund
U. S. Public Health Service (USPHS) [U24MH68457, HHSN271200477471C, P01CA089392]
[K01DA024722]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Background: As the amount of data from genome wide association studies grows dramatically, many interesting scientific questions require imputation to combine or expand datasets. However, there are two situations for which imputation has been problematic: (1) polymorphisms with low minor allele frequency (MAF), and (2) datasets where subjects are genotyped on different platforms. Traditional measures of imputation cannot effectively address these problems. Methodology/Principal Findings: We introduce a new statistic, the imputation quality score (IQS). In order to differentiate between well-imputed and poorly-imputed single nucleotide polymorphisms (SNPs), IQS adjusts the concordance between imputed and genotyped SNPs for chance. We first evaluated IQS in relation to minor allele frequency. Using a sample of subjects genotyped on the Illumina 1 M array, we extracted those SNPs that were also on the Illumina 550 K array and imputed them to the full set of the 1 M SNPs. As expected, the average IQS value drops dramatically with a decrease in minor allele frequency, indicating that IQS appropriately adjusts for minor allele frequency. We then evaluated whether IQS can filter poorly-imputed SNPs in situations where cases and controls are genotyped on different platforms. Randomly dividing the data into ``cases'' and ``controls'', we extracted the Illumina 550 K SNPs from the cases and imputed the remaining Illumina 1 M SNPs. The initial Q-Q plot for the test of association between cases and controls was grossly distorted (lambda = 1.15) and had 4016 false positives, reflecting imputation error. After filtering out SNPs with IQS < 0.9, the Q-Q plot was acceptable and there were no longer false positives. We then evaluated the robustness of IQS computed independently on the two halves of the data. In both European Americans and African Americans the correlation was > 0.99 demonstrating that a database of IQS values from common imputations could be used as an effective filter to combine data genotyped on different platforms. Conclusions/Significance: IQS effectively differentiates well-imputed and poorly-imputed SNPs. It is particularly useful for SNPs with low minor allele frequency and when datasets are genotyped on different platforms.

A New Statistic to Evaluate Imputation Reliability

期刊

PLOS ONE

出版社

PUBLIC LIBRARY SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A New Statistic to Evaluate Imputation Reliability

期刊

PLOS ONE

出版社

PUBLIC LIBRARY SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文