期刊
JOURNAL OF CHEMICAL INFORMATION AND MODELING
卷 62, 期 23, 页码 6271-6286出版社
AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.2c01305
关键词
-
类别
资金
- National Natural Science Foundation of China
- Yunnan Foundation Research Projects
- Yunnan Provincial Major Science and Technology Special Plan Projects
- [61603159]
- [62162033]
- [202201AT070154]
- [202101BE070001-056]
- [202002AD080001]
- [202103AA080015]
The progress in single-cell RNA sequencing (ScRNA-seq) technology allows for the accurate discovery of cell heterogeneity and diversity. Clustering is a crucial step in ScRNA-seq data analysis, but it faces challenges due to the high dimensionality and noise of the data. To overcome these challenges, we propose a novel ScRNA-seq data clustering model, RGNMF-DS, which incorporates similarity and dissimilarity regularizers for matrix decomposition and utilizes a graph regularizer to uncover the local geometric structure in the data. Experimental results demonstrate that our proposed model outperforms other state-of-the-art methods in clustering ScRNA-seq datasets.
The notable progress in single-cell RNA sequencing (ScRNA-seq) technology is beneficial to accurately discover the heterogeneity and diversity of cells. Clustering is an extremely important step during the ScRNA-seq data analysis. However, it cannot achieve satisfactory performances by directly clustering ScRNA-seq data due to its high dimensionality and noise. To address these issues, we propose a novel ScRNA-seq data representation model, termed Robust Graph regularized Non -Negative Matrix Factorization with Dissimilarity and Similarity constraints (RGNMF-DS), for ScRNA-seq data clustering. To accurately characterize the structure information of the labeled samples and the unlabeled samples, respectively, the proposed RGNMF-DS model adopts a couple of complementary regularizers (i.e., similarity and dissimilar regularizers) to guide matrix decomposition. In addition, we construct a graph regularizer to discover the local geometric structure hidden in ScRNA-seq data. Moreover, we adopt the l2,1-norm to measure the reconstruction error and thereby effectively improve the robustness of the proposed RGNMF-DS model to the noises. Experimental results on several ScRNA-seq datasets have demonstrated that our proposed RGNMF-DS model outperforms other state-of-the-art competitors in clustering.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据