Journal
CELL
Volume 176, Issue 3, Pages 663-+Publisher
CELL PRESS
DOI: 10.1016/j.cell.2018.12.019
Keywords
-
Categories
Funding
- U.S. National Institutes of Health (NIH) [HG002385, HG007635, HG009081, HG003079, HG009478]
- National Health and Medical Research Council (NHMRC) CJ Martin Biomedical Fellowship [1073726]
- U.S. National Institutes of Health [T32 HG000035-23]
- National Health and Medical Research Council of Australia [1073726] Funding Source: NHMRC
Ask authors/readers for more resources
In order to provide a comprehensive resource for human structural variants (SVs), we generated long-read sequence data and analyzed SVs for fifteen human genomes. We sequence resolved 99,604 insertions, deletions, and inversions including 2,238 (1.6 Mbp) that are shared among all discovery genomes with an additional 13,053 (6.9 Mbp) present in the majority, indicating minor alleles or errors in the reference. Genotyping in 440 additional genomes confirms the most common SVs in unique euchromatin are now sequence resolved. We report a ninefold SV bias toward the last 5 Mbp of human chromosomes with nearly 55% of all VNTRs (variable number of tandem repeats) mapping to this portion of the genome. We identify SVs affecting coding and non-coding regulatory loci improving annotation and interpretation of functional variation. These data provide the framework to construct a canonical human reference and a resource for developing advanced representations capable of capturing allelic diversity.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available