4.8 Article

Long-read sequencing and de novo assembly of a Chinese genome

期刊

NATURE COMMUNICATIONS
卷 7, 期 -, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/ncomms12065

关键词

-

资金

  1. NIH [HG006465, MH108728, HG007635, HG002385]
  2. National Natural Science Foundation of China [31400922]
  3. Program of Introducing Talents of Discipline to Universities [B14036]
  4. Leading Talents of Guangdong

向作者/读者索取更多资源

Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arrays and generate a de novo assembly of 2.93 Gb (contig N50: 8.3 Mb, scaffold N50: 22.0 Mb, including 39.3Mb N-bases), together with 206Mb of alternative haplotypes. The assembly fully or partially fills 274 (28.4%) N-gaps in the reference genome GRCh38. Comparison to GRCh38 reveals 12.8Mb of HX1-specific sequences, including 4.1Mb that are not present in previously reported Asian genomes. Furthermore, long-read sequencing of the transcriptome reveals novel spliced genes that are not annotated in GENCODE and are missed by short-read RNA-Seq. Our results imply that improved characterization of genome functional variation may require the use of a range of genomic technologies on diverse human populations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据