Journal
APPLIED MATHEMATICAL MODELLING
Volume 90, Issue -, Pages 875-888Publisher
ELSEVIER SCIENCE INC
DOI: 10.1016/j.apm.2020.08.065
Keywords
Single cell RNA-sequencing; Kernel non-negative matrix factorization; Cellular heterogeneity
Funding
- National Natural Science Foundation of China [11901575, 91730301, 11675060]
Ask authors/readers for more resources
This study introduces a novel approach based on kernel non-negative matrix factorization to detect nonlinear gene-gene relationships and build a low-dimensional representation on the original data. Furthermore, an efficient method for determining the optimal cluster number is proposed to improve clustering accuracy.
The emergence of single-cell RNA-sequencing is ideally placed to unravel cellular het-erogeneity in biological systems, an extremely challenging problem in single cell RNA sequencing studies. However, most current computational approaches lack the sensitivity to reliably detect nonlinear gene-gene relationships masked by dropout events. We proposed a kernel non-negative matrix factorization framework for detecting nonlinear relationships among genes, where the new kernel is developed using kernel tricks on cellular differentiability correlation. The newly constructed kernel not only provides a description on the gene-gene relationship, but also helps to build a new low-dimensional representation on the original data. Besides, we developed an efficient method for determining the optimal cluster number within each data set with the usage of Diffusion Maps. The proposed algorithm is further compared with representative algorithms: SC3 and several other state-of-the-art clustering methods, on several benchmark or real scRNA-Seq datasets using internal criteria (clustering number accuracy) and external criteria (Adjusted rand index and Normalized mutual information) to show effectiveness of our method. (c) 2020 Elsevier Inc. All rights reserved.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available