4.6 Article

Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models

Journal

MOLECULAR SYSTEMS BIOLOGY
Volume 17, Issue 1, Pages -

Publisher

WILEY
DOI: 10.15252/msb.20209620

Keywords

scRNA-seq; harmonization; annotation; differential expression; variational inference

Funding

  1. NIH-NIAID [U19 AI090023]
  2. NIMH [U19 MH114821]

Ask authors/readers for more resources

As the number of single-cell transcriptomics datasets increases, integrating data to achieve a common ontology of cell types and states becomes the natural next step. scVI and scANVI are two methods that can effectively integrate data and perform well in cell state annotation, with high accuracy, scalability, and adaptability.
As the number of single-cell transcriptomics datasets grows, the natural next step is to integrate the accumulating data to achieve a common ontology of cell types and states. However, it is not straightforward to compare gene expression levels across datasets and to automatically assign cell type labels in a new dataset based on existing annotations. In this manuscript, we demonstrate that our previously developed method, scVI, provides an effective and fully probabilistic approach for joint representation and analysis of scRNA-seq data, while accounting for uncertainty caused by biological and measurement noise. We also introduce single-cell ANnotation using Variational Inference (scANVI), a semi-supervised variant of scVI designed to leverage existing cell state annotations. We demonstrate that scVI and scANVI compare favorably to state-of-the-art methods for data integration and cell state annotation in terms of accuracy, scalability, and adaptability to challenging settings. In contrast to existing methods, scVI and scANVI integrate multiple datasets with a single generative model that can be directly used for downstream tasks, such as differential expression. Both methods are easily accessible through scvi-tools.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available