Journal
NATURE COMMUNICATIONS
Volume 13, Issue 1, Pages -Publisher
NATURE PORTFOLIO
DOI: 10.1038/s41467-022-29271-y
Keywords
-
Categories
Funding
- National Health and Medical Research Council of Australia [APP1136197, APP1186084]
- University of Sydney Research Training Program Scholarship and Merit Award Supplementary Scholarship
- Common Fund of the Office of the Director of the National Institutes of Health
- NCI
- NHGRI
- NHLBI
- NIDA
- NIMH
- NINDS
Ask authors/readers for more resources
This study demonstrates a method for predicting cryptic donor activation caused by genetic splicing variants, with an empirical method defined by analyzing a large amount of RNA-Seq data, revealing the important determinants of cryptic donor activation.
Genetic variants affecting the consensus splicing motifs can alter binding of spliceosomal components and induce mis-splicing. Here, the authors develop a method, showing that ranking the most common recurring mis-splicing events in public RNA-Seq data can predict the activation of cryptic-donors. Predicting which cryptic-donors may be activated by a splicing variant in patient DNA is notoriously difficult. Through analysis of 5145 cryptic-donors (versus 86,963 decoy-donors not used; any GT or GC), we define an empirical method predicting cryptic-donor activation with 87% sensitivity and 95% specificity. Strength (according to four algorithms) and proximity to the annotated-donor appear important determinants of cryptic-donor activation. However, other factors such as splicing regulatory elements, which are difficult to identify, play an important role and are likely responsible for current prediction inaccuracies. We find that the most frequently recurring natural mis-splicing events at each exon-intron junction, summarised over 40,233 RNA-sequencing samples (40K-RNA), predict with accuracy which cryptic-donor will be activated in rare disease. 40K-RNA provides an accurate, evidence-based method to predict variant-activated cryptic-donors in genetic disorders, assisting pathology consideration of possible consequences of a variant for the encoded protein and RNA diagnostic testing strategies.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available