4.8 Article

Clustering of single-cell multi-omics data with a multimodal deep learning method

Journal

NATURE COMMUNICATIONS
Volume 13, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41467-022-35031-9

Keywords

-

Funding

  1. National Institutes of Health (NIH) [R15HG012087]
  2. National Center for Advancing Translational Sciences (NCATS)
  3. NIH [UL1TR003017]
  4. National Science Foundation [ACI-1548562]

Ask authors/readers for more resources

Single-cell multimodal sequencing technologies provide an opportunity to analyze different types of data in the same cell simultaneously. However, combining multiple data sources for clustering analysis of single-cell multimodal data remains a challenge. In this study, a novel deep learning method called scMDC is developed, which explicitly models different data sources and learns latent features for clustering analysis. The experimental results show that scMDC outperforms existing methods on single-cell multimodal datasets and has linear scalability for analyzing large datasets.
Single-cell multimodal sequencing technologies are developed to simultaneously profile different modalities of data in the same cell. It provides a unique opportunity to jointly analyze multimodal data at the single-cell level for the identification of distinct cell types. A correct clustering result is essential for the downstream complex biological functional studies. However, combining different data sources for clustering analysis of single-cell multimodal data remains a statistical and computational challenge. Here, we develop a novel multimodal deep learning method, scMDC, for single-cell multi-omics data clustering analysis. scMDC is an end-to-end deep model that explicitly characterizes different data sources and jointly learns latent features of deep embedding for clustering analysis. Extensive simulation and real-data experiments reveal that scMDC outperforms existing single-cell single-modal and multimodal clustering methods on different single-cell multimodal datasets. The linear scalability of running time makes scMDC a promising method for analyzing large multimodal datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available