4.7 Review

Matrix factorization for biomedical link prediction and scRNA-seq data imputation: an empirical survey

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 23, Issue 1, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbab479

Keywords

matrix factorization; biomedical network; link prediction; scRNA-seq data; data imputation

Funding

  1. National Natural Science Foundation of China [62173235, 61602309, 11871026, 11871237]
  2. Guangdong Basic and Applied Basic Research Foundation [2019A1515011384]
  3. Shenzhen Fundamental Research Program [JCYJ20170817095210760]

Ask authors/readers for more resources

This paper presents a comprehensive review on the usage of matrix factorization methods in biomedical link prediction and scRNA-seq data imputation. By conducting a systematic empirical comparison on real data sets, the authors provide general guidelines for selecting matrix factorization methods for different biomedical matrix completion tasks and point out some future directions for improving the performance in biomedical link prediction and scRNA-seq data imputation.
Advances in high-throughput experimental technologies promote the accumulation of vast number of biomedical data. Biomedical link prediction and single-cell RNA-sequencing (scRNA-seq) data imputation are two essential tasks in biomedical data analyses, which can facilitate various downstream studies and gain insights into the mechanisms of complex diseases. Both tasks can be transformed into matrix completion problems. For a variety of matrix completion tasks, matrix factorization has shown promising performance. However, the sparseness and high dimensionality of biomedical networks and scRNA-seq data have raised new challenges. To resolve these issues, various matrix factorization methods have emerged recently. In this paper, we present a comprehensive review on such matrix factorization methods and their usage in biomedical link prediction and scRNA-seq data imputation. Moreover, we select representative matrix factorization methods and conduct a systematic empirical comparison on 15 real data sets to evaluate their performance under different scenarios. By summarizing the experimental results, we provide general guidelines for selecting matrix factorization methods for different biomedical matrix completion tasks and point out some future directions to further improve the performance for biomedical link prediction and scRNA-seq data imputation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available