4.6 Article

SRG-Vote: Predicting Mirna-Gene Relationships via Embedding and LSTM Ensemble

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JBHI.2022.3169542

关键词

Feature extraction; Deep learning; Predictive models; Biological system modeling; Urban areas; Bioinformatics; Data mining; Ensemble; LSTM; miRNA-gene relation- ships

资金

  1. National Natural Science Foundation of China [32170654, 32000464]
  2. Shenzhen Research Institute, City University of Hong Kong
  3. Health and Medical Research Fund
  4. Food and Health Bureau, The Government of the Hong Kong Special Administrative Region [07181426]
  5. City University of Hong Kong [CityU 11202219, CityU 11203520, CityU 11203221]

向作者/读者索取更多资源

This paper proposes a model that combines feature extraction methods, deep learning algorithms, and a voting system to study the relationship between miRNAs and genes. By using high-throughput technology to process large amounts of biological data, the model is able to reveal potential associations between miRNAs and genes in cancer therapy.
Targeted therapy for one for a set of genes has made it possible to apply precision medicine for different patients due to the existence of tumor heterogeneity. However, how to regulate those genes are still problematic. One of the natural regulators of genes is microRNAs. Thus, a better understanding of the miRNA-gene interaction mechanism might contribute to future diagnosis, prevention, and cancer therapy. The interactions between microRNA and genes play an essential role in molecular genetics. The in-vivo experiments validating the relationships between them are time-consuming, money-costly, and labor-intensive. With the development of high-throughput technology, we dealt with tons of biological data. However, extracting features from tremendous raw data and making a mathematical model is still a challenging topic. Machine learning and deep learning algorithms have become powerful tools in dealing with biological data. Inspired by this, in this paper, we propose a model that combines features/embedding extraction methods, deep learning algorithms, and a voting system. We leverage doc2vec to generate sequential embedding from molecular sequences. The role2vec, GCN, and GMM for geometrical embedding were generated from the complex network from similarity and pair-wise datasets. For the deep learning algorithms, we leveraged LSTM and Bi-LSTM according to different embedding and features. Finally, we adopted a voting system to balance results from different data sources. The results have shown that our voting system could achieve a higher AUC than the existing benchmark. The case studies demonstrate that our model could reveal potential relationships between miRNAs and genes. The source code, features, and predictive results can be downloaded at https://github.com/Xshelton/SRG-vote.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据