4.8 Article

Telomere-to-telomere assembly of a complete human X chromosome

Journal

NATURE
Volume 585, Issue 7823, Pages 79-+

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41586-020-2547-7

Keywords

-

Funding

  1. NIH/NHGRI [R21 1R21HG010548-01, U01 1U01HG010971, U54 1U54HG007990, R01 HG009190, 2R44HG008118]
  2. Intramural Research Program of the National Human Genome Research Institute, National Institutes of Health
  3. Korea Health Technology R&D Project through the Korea Health Industry Development Institute [HI17C2098]
  4. Intramural Research Program of the National Library of Medicine, National Institutes of Health
  5. Common Fund, Office of the Director, NIH
  6. Stowers Institute for Medical Research
  7. NIH [R01 GM124041, HG002385, HG010169, 1F32GM134558-01, R01CA181308]
  8. National Library of Medicine Big Data Training Grant for Genomics and Neuroscience [5T32LM012419-04]
  9. W. M. Keck Foundation [DT06172015]
  10. NIH/NHLBI [U01 1U01HL137183]
  11. NIH/NHGRI/EMBL [2U41HG007234]
  12. NIGMS [T32 GM007445]
  13. Wellcome Trust [212965/Z/18/Z]
  14. National Institute for Health Research (NIHR) Surgical Reconstruction and Microbiology Research Centre (SRMRC)
  15. NATIONAL HUMAN GENOME RESEARCH INSTITUTE [ZIAHG200398, ZIAHG200330, ZIBHG000196, ZICHG200348] Funding Source: NIH RePORTER
  16. NATIONAL LIBRARY OF MEDICINE [ZIHLM200888] Funding Source: NIH RePORTER
  17. MRC [MR/J014370/1] Funding Source: UKRI
  18. UKRI [MR/S035362/1] Funding Source: UKRI
  19. Wellcome Trust [212965/Z/18/Z] Funding Source: Wellcome Trust

Ask authors/readers for more resources

After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no single chromosome has been finished end to end, and hundreds of unresolved gaps persist(1,2). Here we present a human genome assembly that surpasses the continuity of GRCh38(2), along with a gapless, telomere-to-telomere assembly of a human chromosome. This was enabled by high-coverage, ultra-long-read nanopore sequencing of the complete hydatidiform mole CHM13 genome, combined with complementary technologies for quality improvement and validation. Focusing our efforts on the human X chromosome(3), we reconstructed the centromeric satellite DNA array (approximately 3.1 Mb) and closed the 29 remaining gaps in the current reference, including new sequences from the human pseudoautosomal regions and from cancer-testis ampliconic gene families (CT-X and GAGE). These sequences will be integrated into future human reference genome releases. In addition, the complete chromosome X, combined with the ultra-long nanopore data, allowed us to map methylation patterns across complex tandem repeats and satellite arrays. Our results demonstrate that finishing the entire human genome is now within reach, and the data presented here will facilitate ongoing efforts to complete the other human chromosomes. High-coverage, ultra-long-read nanopore sequencing is used to create a new human genome assembly that improves on the coverage and accuracy of the current reference (GRCh38) and includes the gap-free, telomere-to-telomere sequence of the X chromosome.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available