4.7 Article

Haplotype diversity and sequence heterogeneity of human telomeres

期刊

GENOME RESEARCH
卷 31, 期 7, 页码 -

出版社

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/gr.274639.120

关键词

-

资金

  1. STARR [I9A9071, I130052]
  2. NASA [NNX14AH51G, NNX14AB02G, NNX17AB26G]
  3. National Institutes of Health [R01MH117406, R01NS076465, R01CA249054, R01AI151059, P01HD067244, P01CA214274]
  4. Translational Research Institute for Space Health [NNX16AO69A:0107, NNX16AO69A:0061]
  5. Leukemia and Lymphoma Society (LLS) [923816, MCL982]
  6. National Science Foundation [1840275]
  7. NASA [686935, 683145, NNX14AB02G, NNX14AH51G] Funding Source: Federal RePORTER
  8. Direct For Computer & Info Scie & Enginr
  9. Div Of Information & Intelligent Systems [1840275] Funding Source: National Science Foundation

向作者/读者索取更多资源

Long-read sequencing technology enables accurate capture of long, complex telomeric stretches and repeats, revealing sequence heterogeneity in human telomeres and genetic inheritance patterns of telomeric variants across different ancestries.
Telomeres are regions of repetitive nucleotide sequences capping the ends of eukaryotic chromosomes that protect against deterioration, and whose lengths can be correlated with age and adverse health risk factors. Yet, given their length and repetitive nature, telomeric regions are not easily reconstructed from short-read sequencing, thus making telomere sequencing, mapping, and variant resolution challenging problems. Recently, long-read sequencing, with read lengths measuring in hundreds of kilobase pairs, has made it possible to routinely read into telomeric regions and inspect their sequence structure. Here, we describe a framework for extracting telomeric reads from whole-genome single-molecule sequencing experiments, including de novo identification of telomere repeat motifs and repeat types, and also describe their sequence variation. We find that long, complex telomeric stretches and repeats can be accurately captured with long-read sequencing, observe extensive sequence heterogeneity of human telomeres, discover and localize noncanonical telomere sequence motifs (both previously reported, as well as novel), and validate them in short-read sequence data. These data reveal extensive intra-and inter-population diversity of repeats in telomeric haplotypes, reveal higher paternal inheritance of telomeric variants, and represent the first motif composition maps of multi-kilobase-pair human telomeric haplotypes across three distinct ancestries (Ashkenazi, Chinese, and Utah), which can aid in future studies of genetic variation, aging, and genome biology.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据