4.7 Article

Large-scale analysis of branchpoint usage across species and cell lines

期刊

GENOME RESEARCH
卷 27, 期 4, 页码 639-649

出版社

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/gr.202820.115

关键词

-

资金

  1. National Institutes of Health
  2. National Institute of General Medical Sciences (NIGMS) [P30GM103410]
  3. NCRR [P30RR031153, P20RR018728, S10RR02763]
  4. National Science Foundation (EPSCoR) [0554548]
  5. Lifespan Rhode Island Hospital
  6. Division of Biology and Medicine, Brown University
  7. National Institutes of Health [R01GR527276]
  8. NIGMS [R01GM105681]

向作者/读者索取更多资源

The coding sequence of each human pre-mRNA is interrupted, on average, by 11 introns that must be spliced out for proper gene expression. Each intron contains three obligate signals: a 5' splice site, a branch site, and a 3' splice site. Splice site usage has been mapped exhaustively across different species, cell types, and cellular states. In contrast, only a small fraction of branch sites have been identified even once. The few reported annotations of branch site are imprecise as reverse transcriptase skips several nucleotides while traversing a 2-5 linkage. Here, we report large-scale mapping of the branchpoints from deep sequencing data in three different species and in the SF3B1 K700E oncogenic mutant background. We have developed a novel method whereby raw lariat reads are refined by U2snRNP/pre-mRNA base-pairing models to return the largest current data set of branchpoint sequences with quality metrics. This analysis discovers novel modes of U2snRNA:pre-mRNA base-pairing conserved in yeast and provides insight into the biogenesis of intron circles. Finally, matching branch site usage with isoform selection across the extensive panel of ENCODE RNA-seq data sets offers insight into the mechanisms by which branchpoint usage drives alternative splicing.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据