期刊
RNA
卷 25, 期 2, 页码 205-218出版社
COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1261/rna.069112.118
关键词
N-6-methyladenosine; machine learning; deep learning; RNA word embedding; mRNA
资金
- National Key R&D Program of China [SQ2018YFC090002]
- Natural Science Foundation of China [61771331, 61822306, 61672184]
- Edanz Group China
N-6-Methyladenosine (m(6)A) refers to methylation modification of the adenosine nucleotide acid at the nitrogen-6 position. Many conventional computational methods for identifying N-6-methyladenosine sites are limited by the small amount of data available. Taking advantage of the thousands of m(6)A sites detected by high-throughput sequencing, it is now possible to discover the characteristics of m(6)A sequences using deep learning techniques. To the best of our knowledge, our work is the first attempt to use word embedding and deep neural networks for m(6)A prediction from mRNA sequences. Using four deep neural networks, we developed a model inferred from a larger sequence shifting window that can predict m(6)A accurately and robustly. Four prediction schemes were built with various RNA sequence representations and optimized convolutional neural networks. The soft voting results from the four deep networks were shown to outperform all of the state-of-the-art methods. We evaluated these predictors mentioned above on a rigorous independent test data set and proved that our proposed method outperforms the state-of-the-art predictors. The training, independent, and cross-species testing data sets are much larger than in previous studies, which could help to avoid the problem of overfitting.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据