期刊
NEURAL NETWORKS
卷 129, 期 -, 页码 385-391出版社
PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.neunet.2020.05.027
关键词
CNN; Natural language processing; word2vec; 10-fold cross-validation
资金
- Brain Research Program of the National Research Foundation (NRF), Korea - Korean government (MSIT) [NRF-2017M3C7A1044816]
N-6-methyladenosine (m(6)A) is a well-studied and most common interior messenger RNA (mRNA) modification that plays an important function in cell development. N(6)A is found in all kingdoms of life and many other cellular processes such as RNA splicing, immune tolerance, regulatory functions, RNA processing, and cancer. Despite the crucial role of m(6)A in cells, it was targeted computationally, but unfortunately, the obtained results were unsatisfactory. It is imperative to develop an efficient computational model that can truly represent m(6)A sites. In this regard, an intelligent and highly discriminative computational model namely: m6A-word2vec is introduced for the discrimination of m(6)A sites. Here, a concept of natural language processing in the form of word2vec is used to represent the motif of the target class automatically. These motifs (numerical descriptors) are automatically targeted from the human genome without any clear definition. Further, the extracted feature space is then forwarded to the convolution neural network model as input for prediction. The developed computational model obtained 83.17%, 92.69%, and 90.50% accuracy for benchmark datasets S-1, S-2, and S-3, respectively, using a 10-fold cross-validation test. The predictive outcomes validate that the developed intelligent computational model showed better performance compared to existing computational models. It is thus greatly estimated that the introduced computational model m6A-word2vec may be a supportive and practical tool for elementary and pharmaceutical research such as in drug design along with academia. (C) 2020 Elsevier Ltd. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据