4.8 Article

Learning the Sequence Determinants of Alternative Splicing from Millions of Random Sequences

期刊

CELL
卷 163, 期 3, 页码 698-711

出版社

CELL PRESS
DOI: 10.1016/j.cell.2015.09.054

关键词

-

资金

  1. National Science Foundation (NSF) [0954566]
  2. Burroughs Wellcome Career Award at the Scientific Interface
  3. Direct For Computer & Info Scie & Enginr
  4. Division of Computing and Communication Foundations [1317694] Funding Source: National Science Foundation
  5. Div Of Chem, Bioeng, Env, & Transp Sys
  6. Directorate For Engineering [0954566] Funding Source: National Science Foundation

向作者/读者索取更多资源

Most human transcripts are alternatively spliced, and many disease-causing mutations affect RNA splicing. Toward better modeling the sequence determinants of alternative splicing, we measured the splicing patterns of over two million (M) synthetic mini-genes, which include degenerate subsequences totaling over 100 M bases of variation. The massive size of these training data allowed us to improve upon current models of splicing, as well as to gain new mechanistic insights. Our results show that the vast majority of hexamer sequence motifs measurably influence splice site selection when positioned within alternative exons, with multiple motifs acting additively rather than cooperatively. Intriguingly, motifs that enhance (suppress) exon inclusion in alternative 50 splicing also enhance (suppress) exon inclusion in alternative 30 or cassette exon splicing, suggesting a universal mechanism for alternative exon recognition. Finally, our empirically trained models are highly predictive of the effects of naturally occurring variants on alternative splicing in vivo.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据