4.8 Article

Complete genomic and transcriptional landscape analysis using third-generation sequencing: a case study of Saccharomyces cerevisiae CEN.PK113-7D

期刊

NUCLEIC ACIDS RESEARCH
卷 46, 期 7, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/nar/gky014

关键词

-

资金

  1. Arkansas Research Alliance
  2. Helen Adams AMP
  3. Arkansas Research Alliance Professor Chair
  4. NIH/NIGMS [1P20GM121293]
  5. Knut and Alice Wallenberg Foundation
  6. Novo Nordisk Foundation
  7. UAMS startup fund
  8. NNF Center for Biosustainability [Yeast Cell Factories] Funding Source: researchfish
  9. Novo Nordisk Fonden [NNF10CC1016517] Funding Source: researchfish

向作者/读者索取更多资源

Completion of eukaryal genomes can be difficult task with the highly repetitive sequences along the chromosomes and short read lengths of secondgeneration sequencing. Saccharomyces cerevisiae strain CEN. PK113-7D, widely used as a model organism and a cell factory, was selected for this study to demonstrate the superior capability of very long sequence reads for de novo genome assembly. We generated long reads using two common third-generation sequencing technologies (Oxford Nanopore Technology (ONT) and Pacific Biosciences (PacBio)) and used short reads obtained using Illumina sequencing for error correction. Assembly of the reads derived from all three technologies resulted in complete sequences for all 16 yeast chromosomes, as well as themitochondrial chromosome, in one step. Further, we identified three types of DNA methylation (5mC, 4mC and 6mA). Comparison between the reference strain S288C and strain CEN. PK113-7D identified chromosomal rearrangements against a background of similar gene content between the two strains. We identified full-length transcripts through ONT direct RNA sequencing technology. This allows for the identification of transcriptional landscapes, including untranslated regions (UTRs) (5' UTR and 3' UTR) as well as differential gene expression quantification. About 91% of the predicted transcripts could be consistently detected across biological replicates grown either on glucose or ethanol. Direct RNA sequencing identified many polyadenylated non-coding RNAs, rRNAs, telomere-RNA, long non-coding RNA and antisense RNA. This work demonstrates a strategy to obtain complete genome sequences and transcriptional landscapes that can be applied to other eukaryal organisms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据