☆ 4.6 Article

A combined long-range phasing and long haplotype imputation method to impute phase for SNP genotypes

GENETICS SELECTION EVOLUTION (2011)

期刊

GENETICS SELECTION EVOLUTION

卷 43, 期 -, 页码 -

出版社

BIOMED CENTRAL LTD

DOI: 10.1186/1297-9686-43-12

关键词

类别

Agriculture, Dairy & Animal Science Genetics & Heredity

资金

Australian Research Council [LP100100880]
Australian Research Council [LP100100880] Funding Source: Australian Research Council
Chief Scientist Office [CZB/4/710] Funding Source: researchfish

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Background: Knowing the phase of marker genotype data can be useful in genome-wide association studies, because it makes it possible to use analysis frameworks that account for identity by descent or parent of origin of alleles and it can lead to a large increase in data quantities via genotype or sequence imputation. Long-range phasing and haplotype library imputation constitute a fast and accurate method to impute phase for SNP data. Methods: A long-range phasing and haplotype library imputation algorithm was developed. It combines information from surrogate parents and long haplotypes to resolve phase in a manner that is not dependent on the family structure of a dataset or on the presence of pedigree information. Results: The algorithm performed well in both simulated and real livestock and human datasets in terms of both phasing accuracy and computation efficiency. The percentage of alleles that could be phased in both simulated and real datasets of varying size generally exceeded 98% while the percentage of alleles incorrectly phased in simulated data was generally less than 0.5%. The accuracy of phasing was affected by dataset size, with lower accuracy for dataset sizes less than 1000, but was not affected by effective population size, family data structure, presence or absence of pedigree information, and SNP density. The method was computationally fast. In comparison to a commonly used statistical method (fastPHASE), the current method made about 8% less phasing mistakes and ran about 26 times faster for a small dataset. For larger datasets, the differences in computational time are expected to be even greater. A computer program implementing these methods has been made available. Conclusions: The algorithm and software developed in this study make feasible the routine phasing of high-density SNP chips in large datasets.

A combined long-range phasing and long haplotype imputation method to impute phase for SNP genotypes

期刊

GENETICS SELECTION EVOLUTION

出版社

BIOMED CENTRAL LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A combined long-range phasing and long haplotype imputation method to impute phase for SNP genotypes

期刊

GENETICS SELECTION EVOLUTION

出版社

BIOMED CENTRAL LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文