4.6 Article

Commonalities across computational workflows for uncovering explanatory variants in undiagnosed cases

Journal

GENETICS IN MEDICINE
Volume 23, Issue 6, Pages 1075-1085

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1038/s41436-020-01084-8

Keywords

-

Funding

  1. NIH Common Fund, through the Office of Strategic Coordination/Office of the NIH [U01HG007530, U01HG007942, U01HG007672, U01HG007690, U01HG010218, U01HG007703, U01HG010230, U01HG010217, U01HG010233, U01HG007674, U01HG010215]
  2. Intramural Research Program of the National Human Genome Research Institute

Ask authors/readers for more resources

Genomic sequencing has become a powerful tool for discovering genetic abnormalities underlying rare Mendelian conditions. This study found that while well-established practices exist for initial steps in variant calling and quality control, there is substantial divergence in later stages for variant prioritization and multimodal data integration across different sites. The differences suggest that advancements in structural variant detection, noncoding variant interpretation, and integration of additional biomedical data could be especially promising for solving chronically undiagnosed cases.
Purpose Genomic sequencing has become an increasingly powerful and relevant tool to be leveraged for the discovery of genetic aberrations underlying rare, Mendelian conditions. Although the computational tools incorporated into diagnostic workflows for this task are continually evolving and improving, we nevertheless sought to investigate commonalities across sequencing processing workflows to reveal consensus and standard practice tools and highlight exploratory analyses where technical and theoretical method improvements would be most impactful. Methods We collected details regarding the computational approaches used by a genetic testing laboratory and 11 clinical research sites in the United States participating in the Undiagnosed Diseases Network via meetings with bioinformaticians, online survey forms, and analyses of internal protocols. Results We found that tools for processing genomic sequencing data can be grouped into four distinct categories. Whereas well-established practices exist for initial variant calling and quality control steps, there is substantial divergence across sites in later stages for variant prioritization and multimodal data integration, demonstrating a diversity of approaches for solving the most mysterious undiagnosed cases. Conclusion The largest differences across diagnostic workflows suggest that advances in structural variant detection, noncoding variant interpretation, and integration of additional biomedical data may be especially promising for solving chronically undiagnosed cases.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available