4.5 Article

Long-read transcriptome sequencing reveals abundant promoter diversity in distinct molecular subtypes of gastric cancer

期刊

GENOME BIOLOGY
卷 22, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s13059-021-02261-x

关键词

Gastric cancer; Alternative splicing; Alternative promoter; Iso-seq

资金

  1. Kyowa Hakko Kirin CO., LTD. [2016-1700]
  2. Khoo Postdoctoral Fellowship Award [Duke-NUS-KPFA/2019/0031]
  3. National Medical Research Council [NMRC/STaR/0026/2015, OF-LCG18May-0023]
  4. National Research Foundation Singapore
  5. Singapore Ministry of Education under its Research Centres of Excellence initiative

向作者/读者索取更多资源

In this study, a comprehensive survey of the transcriptome of gastric cancer was conducted using a combination of long-read and short-read RNA sequencing technologies. The results revealed a large number of novel transcripts, which are more likely to be cell line and subtype specific, expressed at lower levels, with larger number of exons, and longer sequence lengths. The study also identified cancer-associated isoforms and potential changes in coding sequences, providing valuable insight for deeper studies of gastrointestinal malignancies.
Background Deregulated gene expression is a hallmark of cancer; however, most studies to date have analyzed short-read RNA sequencing data with inherent limitations. Here, we combine PacBio long-read isoform sequencing (Iso-Seq) and Illumina paired-end short-read RNA sequencing to comprehensively survey the transcriptome of gastric cancer (GC), a leading cause of global cancer mortality. Results We performed full-length transcriptome analysis across 10 GC cell lines covering four major GC molecular subtypes (chromosomal unstable, Epstein-Barr positive, genome stable and microsatellite unstable). We identify 60,239 non-redundant full-length transcripts, of which > 66% are novel compared to current transcriptome databases. Novel isoforms are more likely to be cell line and subtype specific, expressed at lower levels with larger number of exons, with longer isoform/coding sequence lengths. Most novel isoforms utilize an alternate first exon, and compared to other alternative splicing categories, are expressed at higher levels and exhibit higher variability. Collectively, we observe alternate promoter usage in 25% of detected genes, with the majority (84.2%) of known/novel promoter pairs exhibiting potential changes in their coding sequences. Mapping these alternate promoters to TCGA GC samples, we identify several cancer-associated isoforms, including novel variants of oncogenes. Tumor-specific transcript isoforms tend to alter protein coding sequences to a larger extent than other isoforms. Analysis of outcome data suggests that novel isoforms may impart additional prognostic information. Conclusions Our results provide a rich resource of full-length transcriptome data for deeper studies of GC and other gastrointestinal malignancies.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据