4.8 Article

Unconstrained mining of transcript data reveals increased alternative splicing complexity in the human transcriptome

Journal

NUCLEIC ACIDS RESEARCH
Volume 38, Issue 14, Pages 4740-4754

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkq197

Keywords

-

Funding

  1. Muscular Dystrophy Association [MDA3662]
  2. European Commission [LSHG-CT-2005-518238]
  3. 'Fundacao para a Ciencia e Tecnologia, FCT', Portugal [PTDC/SAU-GMG/69739/2006]
  4. Fundação para a Ciência e a Tecnologia [PTDC/SAU-GMG/69739/2006] Funding Source: FCT

Ask authors/readers for more resources

Mining massive amounts of transcript data for alternative splicing information is paramount to help understand how the maturation of RNA regulates gene expression. We developed an algorithm to cluster transcript data to annotated genes to detect unannotated splice variants. A higher number of alternatively spliced genes and isoforms were found compared to other alternative splicing databases. Comparison of human and mouse data revealed a marked increase, in human, of splice variants incorporating novel exons and retained introns. Previously unannotated exons were validated by tiling array expression data and shown to correspond preferentially to novel first exons. Retained introns were validated by tiling array and deep sequencing data. The majority of retained introns were shorter than 500 nt and had weak polypyrimidine tracts. A subset of retained introns matching small RNAs and displaying a high GC content suggests a possible coordination between splicing regulation and production of noncoding RNAs. Conservation of unannotated exons and retained introns was higher in horse, dog and cow than in rodents, and 64% of exon sequences were only found in primates. This analysis highlights previously bypassed alternative splice variants, which may be crucial to deciphering more complex pathways of gene regulation in human.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available