4.8 Article

Mapping single-cell data to reference atlases by transfer learning

Journal

NATURE BIOTECHNOLOGY
Volume 40, Issue 1, Pages 121-+

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41587-021-01001-7

Keywords

-

Funding

  1. Joachim Herz Stiftung
  2. BMBF [01IS18036A, 01IS18036B]
  3. European Union [874656]
  4. Helmholtz Association's Initiative and Networking Fund [874656, ZT-I-PF-5-01, ZT-I-0007]

Ask authors/readers for more resources

scArches is a deep learning strategy for mapping query datasets on top of a reference, allowing efficient and decentralized reference construction while preserving biological state information and removing batch effects. It generalizes to multimodal reference mapping and can impute missing modalities.
Large single-cell atlases are now routinely generated to serve as references for analysis of smaller-scale studies. Yet learning from reference data is complicated by batch effects between datasets, limited availability of computational resources and sharing restrictions on raw data. Here we introduce a deep learning strategy for mapping query datasets on top of a reference called single-cell architectural surgery (scArches). scArches uses transfer learning and parameter optimization to enable efficient, decentralized, iterative reference building and contextualization of new datasets with existing references without sharing raw data. Using examples from mouse brain, pancreas, immune and whole-organism atlases, we show that scArches preserves biological state information while removing batch effects, despite using four orders of magnitude fewer parameters than de novo integration. scArches generalizes to multimodal reference mapping, allowing imputation of missing modalities. Finally, scArches retains coronavirus disease 2019 (COVID-19) disease variation when mapping to a healthy reference, enabling the discovery of disease-specific cell states. scArches will facilitate collaborative projects by enabling iterative construction, updating, sharing and efficient use of reference atlases. Single-cell data are readily integrated with cell atlases using scArches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available