4.7 Article

Characterizing RNA Pseudouridylation by Convolutional Neural Networks

Journal

GENOMICS PROTEOMICS & BIOINFORMATICS
Volume 19, Issue 5, Pages 815-833

Publisher

ELSEVIER
DOI: 10.1016/j.gpb.2019.11.015

Keywords

Pseudouridylation; Convolution neural network; Sequence motif; Translation; RNA stability

Funding

  1. National Natural Science Foundation of China [61472205, 81630103]
  2. US National Science Foundation [DBI-1262107, IIS-1646333]
  3. China's Youth 1000Talent Program
  4. Beijing Advanced Innovation Center for Structural Biology

Ask authors/readers for more resources

The study introduced a model named PULSE based on convolutional neural network for analyzing large-scale Psi site data and characterizing the sequence features of pseudouridylation. The validation tests showed that PULSE outperformed other prediction methods, providing novel insights into the functional roles of pseudouridylation and enabling further research on the transcriptome-wide landscape of Psi sites.
Pseudouridine (Psi) is the most prevalent post-transcriptional RNA modification and is widespread in small cellular RNAs and mRNAs. However, the functions, mechanisms, and precise distribution of Psi s (especially in mRNAs) still remain largely unclear. The landscape of Psi s across the transcriptome has not yet been fully delineated. Here, we present a highly effective model based on a convolutional neural network (CNN), called PseudoUridyLation Site Estimator (PULSE), to analyze large-scale profiling data of Psi sites and characterize the contextual sequence features of pseudouridylation. PULSE, consisting of two alternatively-stacked convolution and pooling layers followed by a fully-connected neural network, can automatically learn the hidden patterns of pseudouridylation from the local sequence information. Extensive validation tests demonstrated that PULSE can outperform other state-of-the-art prediction methods and achieve high prediction accuracy, thus enabling us to further characterize the transcriptome-wide landscape of Psi sites. We further showed that the prediction results derived from PULSE can provide novel insights into understanding the functional roles of pseudouridylation, such as the regulations of RNA secondary structure, codon usage, translation, and RNA stability, and the connection to single nucleotide variants. The source code and final model for PULSE are available at https://github.com/mlcb-thu/PULSE.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available