4.8 Article

Model-based deep embedding for constrained clustering analysis of single cell RNA-seq data

期刊

NATURE COMMUNICATIONS
卷 12, 期 1, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41467-021-22008-3

关键词

-

资金

  1. Extreme Science and Engineering Discovery Environment (XSEDE) through allocation [CIE160021, CIE170034]
  2. National Science Foundation [ACI-1548562]

向作者/读者索取更多资源

Clustering is a critical step in single cell-based studies, but purely unsupervised clustering may not produce biologically interpretable results in high dimensional and sparse data settings. Integrating domain knowledge into the clustering step is necessary to improve the interpretability and accuracy of cell type identification.
Clustering is a critical step in single cell-based studies. Most existing methods support unsupervised clustering without the a priori exploitation of any domain knowledge. When confronted by the high dimensionality and pervasive dropout events of scRNA-Seq data, purely unsupervised clustering methods may not produce biologically interpretable clusters, which complicates cell type assignment. In such cases, the only recourse is for the user to manually and repeatedly tweak clustering parameters until acceptable clusters are found. Consequently, the path to obtaining biologically meaningful clusters can be ad hoc and laborious. Here we report a principled clustering method named scDCC, that integrates domain knowledge into the clustering step. Experiments on various scRNA-seq datasets from thousands to tens of thousands of cells show that scDCC can significantly improve clustering performance, facilitating the interpretability of clusters and downstream analyses, such as cell type assignment. Clustering cells based on similarities in gene expression is the first step towards identifying cell types in scRNASeq data. Here the authors incorporate biological knowledge into the clustering step to facilitate the biological interpretability of clusters, and subsequent cell type identification.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据