4.8 Article

Reference-based phasing using the Haplotype Reference Consortium panel

期刊

NATURE GENETICS
卷 48, 期 11, 页码 1443-1448

出版社

NATURE PORTFOLIO
DOI: 10.1038/ng.3679

关键词

-

资金

  1. US National Institutes of Health [R01 HG006399, R01 MH101244, HG007022, HL117626]
  2. Wellcome Trust [WT098051]
  3. Austrian Science Fund (FWF) [J-3401]
  4. Fannie and John Hertz Foundation
  5. NCRR [1S10RR028832-01]
  6. Netherlands Scientific Organization [NWO 480-05-003]
  7. [F32 HG007805]
  8. MRC [G0801823] Funding Source: UKRI
  9. Medical Research Council [G0801823, MC_qA137853] Funding Source: researchfish

向作者/读者索取更多资源

Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts but attains lower accuracy in smaller cohorts. Here we instead explore the paradigm of reference-based phasing. We introduce a new phasing algorithm, Eagle2, that attains high accuracy across a broad range of cohort sizes by efficiently leveraging information from large external reference panels (such as the Haplotype Reference Consortium; HRC) using a new data structure based on the positional Burrows-Wheeler transform. We demonstrate that Eagle2 attains a similar to 20x speedup and similar to 10% increase in accuracy compared to reference-based phasing using SHAPEIT2. On European ancestry samples, Eagle2 with the HRC panel achieves >2x the accuracy of 1000 Genomes-based phasing. Eagle2 is open source and freely available for HRC-based phasing via the Sanger Imputation Service and the Michigan Imputation Server.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据