4.8 Article

Improved genome inference in the MHC using a population reference graph

Journal

NATURE GENETICS
Volume 47, Issue 6, Pages 682-688

Publisher

NATURE PORTFOLIO
DOI: 10.1038/ng.3257

Keywords

-

Funding

  1. GlaxoSmithKline
  2. Wellcome Trust [100956/Z/13/Z, 102541/Z/13/Z]
  3. Nuffield Department of Medicine Fellowship
  4. Royal Society [102541/Z/13/Z]
  5. Wellcome Trust [102541/Z/13/Z] Funding Source: Wellcome Trust

Ask authors/readers for more resources

Although much is known about human genetic variation, such information is typically ignored in assembling new genomes. Instead, reads are mapped to a single reference, which can lead to poor characterization of regions of high sequence or structural diversity. We introduce a population reference graph, which combines multiple reference sequences and catalogs of variation. The genomes of new samples are reconstructed as paths through the graph using an efficient hidden Markov model, allowing for recombination between different haplotypes and additional variants. By applying the method to the 4.5-Mb extended MHC region on human chromosome 6, combining 8 assembled haplotypes, the sequences of known classical HLA alleles and 87,640 SNP variants from the 1000 Genomes Project, we demonstrate using simulations, SNP genotyping, and short-read and long-read data how the method improves the accuracy of genome inference and identified regions where the current set of reference sequences is substantially incomplete.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available