4.7 Article

Characterizing RNA Pseudouridylation by Convolutional Neural Networks

期刊

GENOMICS PROTEOMICS & BIOINFORMATICS
卷 19, 期 5, 页码 815-833

出版社

ELSEVIER
DOI: 10.1016/j.gpb.2019.11.015

关键词

Pseudouridylation; Convolution neural network; Sequence motif; Translation; RNA stability

资金

  1. National Natural Science Foundation of China [61472205, 81630103]
  2. US National Science Foundation [DBI-1262107, IIS-1646333]
  3. China's Youth 1000Talent Program
  4. Beijing Advanced Innovation Center for Structural Biology

向作者/读者索取更多资源

The study introduced a model named PULSE based on convolutional neural network for analyzing large-scale Psi site data and characterizing the sequence features of pseudouridylation. The validation tests showed that PULSE outperformed other prediction methods, providing novel insights into the functional roles of pseudouridylation and enabling further research on the transcriptome-wide landscape of Psi sites.
Pseudouridine (Psi) is the most prevalent post-transcriptional RNA modification and is widespread in small cellular RNAs and mRNAs. However, the functions, mechanisms, and precise distribution of Psi s (especially in mRNAs) still remain largely unclear. The landscape of Psi s across the transcriptome has not yet been fully delineated. Here, we present a highly effective model based on a convolutional neural network (CNN), called PseudoUridyLation Site Estimator (PULSE), to analyze large-scale profiling data of Psi sites and characterize the contextual sequence features of pseudouridylation. PULSE, consisting of two alternatively-stacked convolution and pooling layers followed by a fully-connected neural network, can automatically learn the hidden patterns of pseudouridylation from the local sequence information. Extensive validation tests demonstrated that PULSE can outperform other state-of-the-art prediction methods and achieve high prediction accuracy, thus enabling us to further characterize the transcriptome-wide landscape of Psi sites. We further showed that the prediction results derived from PULSE can provide novel insights into understanding the functional roles of pseudouridylation, such as the regulations of RNA secondary structure, codon usage, translation, and RNA stability, and the connection to single nucleotide variants. The source code and final model for PULSE are available at https://github.com/mlcb-thu/PULSE.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据