4.7 Article

A complete pedigree-based graph workflow for rare candidate variant analysis

Journal

GENOME RESEARCH
Volume 32, Issue 5, Pages 893-903

Publisher

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/gr.276387.121

Keywords

-

Funding

  1. Intramural Research Program of the National Human Genome Research Institute
  2. Common Fund, Office of the Director, National Institutes of Health
  3. National Institutes of Health [U41HG010972, R01HG010485, U01HG010961, OT3HL142481, OT2OD026682, U01HL137183, 2U41HG007234]

Ask authors/readers for more resources

This study introduces a pedigree-aware workflow based on pangenome graphs to improve the accuracy of genome mapping and variant calling. The workflow shows significant improvements in single-nucleotide variants and insertion/deletion variants compared to linear-reference mapping and pangenome graph mapping. Additionally, the study adapts and upgrades deleterious-variant detecting methods for streamlined application in undiagnosed diseases.
Methods that use a linear genome reference for genome sequencing data analysis are reference-biased. In the field of clinical genetics for rare diseases, a resulting reduction in genotyping accuracy in some regions has likely prevented the resolution of some cases. Pangenome graphs embed population variation into a reference structure. Although pangenome graphs have helped to reduce reference mapping bias, further performance improvements are possible. We introduce VG-Pedigree, a pedigree-aware workflow based on the pangenome-mapping tool of Giraffe and the variant calling tool DeepTrio using a specially trained model for Giraffe-based alignments. We demonstrate mapping and variant calling improvements in both single-nucleotide variants (SNVs) and insertion and deletion (indel) variants over those produced by alignments created using BWA-MEM to a linear-reference and Giraffe mapping to a pangenome graph containing data from the 1000 Genomes Project. We have also adapted and upgraded deleterious-variant (DV) detecting methods and programs into a streamlined workflow. We used these workflows in combination to detect small lists of candidate DVs among 15 family quartets and quintets of the Undiagnosed Diseases Program (UDP). All candidate DVs that were previously diagnosed using the Mendelian models covered by the previously published methods were recapitulated by these workflows. The results of these experiments indicate that a slightly greater absolute count of DVs are detected in the proband population than in their matched unaffected siblings.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available