4.7 Article

WgLink: reconstructing whole-genome viral haplotypes using L-0+L-1-regularization

Journal

BIOINFORMATICS
Volume 37, Issue 17, Pages 2744-2746

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab076

Keywords

-

Funding

  1. National Science and Engineering Research Council (NSERC)
  2. Canada Foundation for Innovation
  3. Genome Canada/Genome Alberta
  4. Alberta Children's Hospital Research Institute Postdoctoral Fellowship

Ask authors/readers for more resources

WgLink is a tool that can reconstruct whole viral genomes with high accuracy, consuming significantly less memory and fewer CPU hours compared to existing methods.
Many tools can reconstruct viral sequences based on next-generation sequencing reads. Although existing tools effectively recover local regions, their accuracy suffers when reconstructing the whole viral genomes (strains). Moreover, they consume significant memory when the sequencing coverage is high or when the genome size is large. We present WgLink to meet this challenge. WgLink takes local reconstructions produced by other tools as input and patches the resulting segments together into coherent whole-genome strains. We accomplish this using an L-0+L-1-regularized regression, synthesizing variant allele frequency data with physical linkage between multiple variants spanning multiple regions simultaneously. WgLink achieves higher accuracy than existing tools both on simulated and on real datasets while using significantly less memory (RAM) and fewer CPU hours.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available