4.7 Article

LongAGE: defining breakpoints of genomic structural variants through optimal and memory efficient alignments of long reads

Journal

BIOINFORMATICS
Volume 37, Issue 7, Pages 1015-1017

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btaa703

Keywords

-

Funding

  1. National Institute of Health [U24CA220242]

Ask authors/readers for more resources

The study found that different breakpoints for deletions and duplications at the same locus provide direct evidence that these multi-allelic copy number variants arise from multiple independent ancestral mutations. To address the problem of large gaps in alignment affecting single-nucleotide breakpoint resolution, a memory-efficient implementation called LongAGE was developed.
Defining the precise location of structural variations (SVs) at single-nucleotide breakpoint resolution is a challenging problem due to large gaps in alignment. Previously, Alignment with Gap Excision (AGE) enabled us to define breakpoints of SVs at single-nucleotide resolution; however, AGE requires a vast amount of memory when aligning a pair of long sequences. To address this, we developed a memory-efficient implementation-LongAGE-based on the classical Hirschberg algorithm. We demonstrate an application of LongAGE for resolving breakpoints of SVs embedded into segmental duplications on Pacific Biosciences (PacBio) reads that can be longer than 10 kb. Furthermore, we observed different breakpoints for a deletion and a duplication in the same locus, providing direct evidence that such multi-allelic copy number variants (mCNVs) arise from two or more independent ancestral mutations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available