4.5 Article

Assessing reproducibility of inherited variants detected with short-read whole genome sequencing

Journal

GENOME BIOLOGY
Volume 23, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s13059-021-02569-8

Keywords

-

Funding

  1. National Key R&D Project of China [2018YFE0201603, 2018YFE0201600, 2017YFF0204600]
  2. National Natural Science Foundation of China [31720103909, 32170657]
  3. Shanghai Municipal Science and Technology Major Project [2017SHZDZX01]
  4. State Key Laboratory of Genetic Engineering [SKLGE-2117]
  5. 111 Project [B13016]
  6. Intramural Research Program of the National Library of Medicine, National Institutes of Health
  7. JSPS KAKENHI [17H00789]
  8. Open Research Fund of Key Laboratory of Advanced Theory and Application in Statistics and Data Science -MOE
  9. ECNU
  10. NIH [UM1 HG008898]
  11. Grants-in-Aid for Scientific Research [17H00789] Funding Source: KAKEN

Ask authors/readers for more resources

This study systematically evaluates the reproducibility of inherited variants with whole genome sequencing (WGS) and identifies the impact of different factors on variant call quality. The findings show that bioinformatics pipelines have a larger impact on reproducibility than WGS platform or library preparation. Single-nucleotide variants (SNVs) are more reproducible than small insertions and deletions (indels), and increasing sequencing coverage improves indel reproducibility but has limited impact on SNVs above 30x.
Background: Reproducible detection of inherited variants with whole genome sequencing (WGS) is vital for the implementation of precision medicine and is a complicated process in which each step affects variant call quality. Systematically assessing reproducibility of inherited variants with WGS and impact of each step in the process is needed for understanding and improving quality of inherited variants from WGS. Results: To dissect the impact of factors involved in detection of inherited variants with WGS, we sequence triplicates of eight DNA samples representing two populations on three short-read sequencing platforms using three library kits in six labs and call variants with 56 combinations of aligners and callers. We find that bioinformatics pipelines (callers and aligners) have a larger impact on variant reproducibility than WGS platform or library preparation. Single-nucleotide variants (SNVs), particularly outside difficult-to-map regions, are more reproducible than small insertions and deletions (indels), which are least reproducible when > 5 bp. Increasing sequencing coverage improves indel reproducibility but has limited impact on SNVs above 30x. Conclusions: Our findings highlight sources of variability in variant detection and the need for improvement of bioinformatics pipelines in the era of precision medicine with WGS.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available