4.8 Article

Online single-cell data integration through projecting heterogeneous datasets into a common cell-embedding space

期刊

NATURE COMMUNICATIONS
卷 13, 期 1, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41467-022-33758-z

关键词

-

资金

  1. State Key Research Development Program of China [2019YFA0110002]
  2. National Natural Science Foundation of China [32125007, 91940306]
  3. Beijing Advanced Innovation Center for Structural Biology
  4. Tsinghua-Peking Joint Center for Life Sciences
  5. Tsinghua University Branch of China National Center for Protein Sciences
  6. King Abdullah University of Science and Technology (KAUST) Office of Research Administration (ORA) [FCC/1/1976-44-01, FCC/1/1976-45-01, URF/1/4352-01-01, URF/1/4663-01-01]

向作者/读者索取更多资源

SCALEX is a deep-learning method for online integration of diverse single-cell data. It accurately aligns different modalities of single-cell data, retains true biological differences, and has superior performance in large-scale single-cell applications.
Computational tools for integrative analyses of diverse single-cell experiments are facing formidable new challenges including dramatic increases in data scale, sample heterogeneity, and the need to informatively cross-reference new data with foundational datasets. Here, we present SCALEX, a deep-learning method that integrates single-cell data by projecting cells into a batch-invariant, common cell-embedding space in a truly online manner (i.e., without retraining the model). SCALEX substantially outperforms online iNMF and other state-of-the-art non-online integration methods on benchmark single-cell datasets of diverse modalities, (e.g., single-cell RNA sequencing, scRNA-seq, single-cell assay for transposase-accessible chromatin use sequencing, scATAC-seq), especially for datasets with partial overlaps, accurately aligning similar cell populations while retaining true biological differences. We showcase SCALEX's advantages by constructing continuously expandable single-cell atlases for human, mouse, and COVID-19 patients, each assembled from diverse data sources and growing with every new data. The online data integration capacity and superior performance makes SCALEX particularly appropriate for large-scale single-cell applications to build upon previous scientific insights.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据