4.5 Article

DNRLCNN: A CNN Framework for Identifying MiRNA-Disease Associations Using Latent Feature Matrix Extraction with Positive Samples

Journal

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s12539-022-00509-z

Keywords

Similarity networks; Disease-gene association; Latent feature extraction; Positive samples; Convolutional neural network

Funding

  1. Natural Science Foundation of Hunan Province of China [2018JJ2262, 2020JJ5373]
  2. CSC [201906725017]
  3. Scientific Research Fund of Hunan Provincial Education Department [15CY007, 22A211, 20B348]
  4. Natural Science Foundation of Yunnan Province of China [2019FA024]
  5. National Natural Science Foundation of China [62002116, 61972185]

Ask authors/readers for more resources

miRNAs have strong relationships with human diseases, and studying these associations can provide insights into the activities of miRNAs and pathogenesis mechanisms, as well as new opportunities for disease diagnosis and drug discovery. This paper proposes a convolutional neural network framework called DNRLCNN, which uses a latent feature matrix extracted from positive samples to predict miRNA-disease associations. The model outperforms other state-of-the-art methods in predicting associations and has been validated on two human diseases.
Emerging evidence indicates that miRNAs have strong relationships with many human diseases. Investigating the associations will contribute to elucidating the activities of miRNAs and pathogenesis mechanisms, and providing new opportunities for disease diagnosis and drug discovery. Therefore, it is of significance to identify potential associations between miRNAs and diseases. The existing databases about the miRNA-disease associations (MDAs) only provide the known MDAs, which can be regarded as positive samples. However, the unknown MDAs are not sufficient to regard as reliable negative samples. To deal with this uncertainty, we proposed a convolutional neural network (CNN) framework, named DNRLCNN, based on a latent feature matrix extracted by only positive samples to predict MDAs. First, by only considering the positive samples into the calculation process, we captured the latent feature matrix for complex interactions between miRNAs and diseases in low-dimensional space. Then, we constructed a feature vector for each miRNA and disease pair based on the feature representation. Finally, we adopted a modified CNN for the feature vector to predict MDAs. As a result, our model achieves better performance than other state-of-the-art methods which based CNN in fivefold cross-validation on both miRNA-disease association prediction task (average AUC of 0.9030) and miRNA-phenotype association prediction task (average AUC of 0. 9442). In addition, we carried out case studies on two human diseases, and all the top-50 predicted miRNAs for lung neoplasms are confirmed by HMDD v3.2 and dbDEMC 2.0 databases, 98% of the top-50 predicted miRNAs for heart failure are confirmed. The experiment results show that our model has the capability of inferring potential disease-related miRNAs. [GRAPHICS] .

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available