4.8 Article

Integrated analysis of multimodal single-cell data with structural similarity

Journal

NUCLEIC ACIDS RESEARCH
Volume 50, Issue 21, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkac781

Keywords

-

Funding

  1. National Science Foundation [IIS-1715017, DMS-1763272]
  2. National Institutes of Health [U54-CA217378, R01HG012572]
  3. National Institute of Mental Health [MH123896]
  4. Simons Foundation [594598]

Ask authors/readers for more resources

In this study, we propose a deep learning framework called SAILERX for efficient, robust, and flexible analysis of multi-modal single-cell data. SAILERX utilizes invariant representation learning and multimodal data alignment mechanism to handle noise, integrate information, and provide various downstream analysis functions.
Multimodal single-cell sequencing technologies provide unprecedented information on cellular heterogeneity from multiple layers of genomic readouts. However, joint analysis of two modalities without properly handling the noise often leads to overfitting of one modality by the other and worse clustering results than vanilla single-modality analysis. How to efficiently utilize the extra information from single cell multi-omics to delineate cell states and identify meaningful signal remains as a significant computational challenge. In this work, we propose a deep learning framework, named SAILERX, for efficient, robust, and flexible analysis of multi-modal single-cell data. SAILERX consists of a variational autoencoder with invariant representation learning to correct technical noises from sequencing process, and a multimodal data alignment mechanism to integrate information from different modalities. Instead of performing hard alignment by projecting both modalities to a shared latent space, SAILERX encourages the local structures of two modalities measured by pairwise similarities to be similar. This strategy is more robust against overfitting of noises, which facilitates various downstream analysis such as clustering, imputation, and marker gene detection. Furthermore, the invariant representation learning part enables SAILERX to perform integrative analysis on both multi- and single-modal datasets, making it an applicable and scalable tool for more general scenarios.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available