4.7 Article

SMILE: mutual information learning for integration of single-cell omics data

Journal

BIOINFORMATICS
Volume 38, Issue 2, Pages 476-486

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab706

Keywords

-

Funding

  1. National Institute of General Medical Sciences of the National Institutes of Health [R35GM133557]

Ask authors/readers for more resources

SMILE is an unsupervised deep learning algorithm that successfully integrates multi-source single-cell omics data and removes batch effects. By maximizing mutual information, it can project cell types from different tissues into a shared space and integrate data from different modalities.
Motivation: Deep learning approaches have empowered single-cell omics data analysis in many ways and generated new insights from complex cellular systems. As there is an increasing need for single-cell omics data to be integrated across sources, types and features of data, the challenges of integrating single-cell omics data are rising. Here, we present an unsupervised deep learning algorithm that learns discriminative representations for single-cell data via maximizing mutual information, SMILE (Single-cell Mutual Information Learning). Results: Using a unique cell-pairing design, SMILE successfully integrates multisource single-cell transcriptome data, removing batch effects and projecting similar cell types, even from different tissues, into the shared space. SMILE can also integrate data from two or more modalities, such as joint-profiling technologies using single-cell ATAC-seq, RNA-seq, DNA methylation, Hi-C and ChIP data. When paired cells are known, SMILE can integrate data with unmatched feature, such as genes for RNA-seq and genome-wide peaks for ATAC-seq. Integrated representations learned from joint-profiling technologies can then be used as a framework for comparing independent single source data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available