4.4 Article

Gene2vec: gene subsequence embedding for prediction of mammalian N6-methyladenosine sites from mRNA

期刊

RNA
卷 25, 期 2, 页码 205-218

出版社

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1261/rna.069112.118

关键词

N-6-methyladenosine; machine learning; deep learning; RNA word embedding; mRNA

资金

  1. National Key R&D Program of China [SQ2018YFC090002]
  2. Natural Science Foundation of China [61771331, 61822306, 61672184]
  3. Edanz Group China

向作者/读者索取更多资源

N-6-Methyladenosine (m(6)A) refers to methylation modification of the adenosine nucleotide acid at the nitrogen-6 position. Many conventional computational methods for identifying N-6-methyladenosine sites are limited by the small amount of data available. Taking advantage of the thousands of m(6)A sites detected by high-throughput sequencing, it is now possible to discover the characteristics of m(6)A sequences using deep learning techniques. To the best of our knowledge, our work is the first attempt to use word embedding and deep neural networks for m(6)A prediction from mRNA sequences. Using four deep neural networks, we developed a model inferred from a larger sequence shifting window that can predict m(6)A accurately and robustly. Four prediction schemes were built with various RNA sequence representations and optimized convolutional neural networks. The soft voting results from the four deep networks were shown to outperform all of the state-of-the-art methods. We evaluated these predictors mentioned above on a rigorous independent test data set and proved that our proposed method outperforms the state-of-the-art predictors. The training, independent, and cross-species testing data sets are much larger than in previous studies, which could help to avoid the problem of overfitting.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据