4.7 Article

scAdapt: virtual adversarial domain adaptation network for single cell RNA-seq data classification across platforms and species

期刊

BRIEFINGS IN BIOINFORMATICS
卷 22, 期 6, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbab281

关键词

single cell classification; batch correction; batch effects; virtual adversarial training; single cell RNA-seq; spatial transcriptomic

资金

  1. National Key R&D Program of China [2020YFB020003]
  2. National Natural Science Foundation of China [61772566, 81801132]
  3. Guangdong Frontier & Key Tech Innovation Program [2018B010109006, 2019B020228001]
  4. Natural Science Foundation of Guangdong, China [2019A1515012207, 2016ZT06D211]
  5. Introducing Innovative and Entrepreneurial Teams [2016ZT06D211]

向作者/读者索取更多资源

The study developed a virtual adversarial domain adaptation network called scAdapt to transfer cell labels between datasets with batch effects. scAdapt utilized both the labeled source and unlabeled target data to train an enhanced classifier, aligning centroids to create a joint embedding. The scAdapt outperformed existing methods for classification in various datasets and showed superior capabilities in preserving cluster structure.
In single cell analyses, cell types are conventionally identified based on expressions of known marker genes, whose identifications are time-consuming and irreproducible. To solve this issue, many supervised approaches have been developed to identify cell types based on the rapid accumulation of public datasets. However, these approaches are sensitive to batch effects or biological variations since the data distributions are different in cross-platforms or species predictions. In this study, we developed scAdapt, a virtual adversarial domain adaptation network, to transfer cell labels between datasets with batch effects. scAdapt used both the labeled source and unlabeled target data to train an enhanced classifier and aligned the labeled source centroids and pseudo-labeled target centroids to generate a joint embedding. The scAdapt was demonstrated to outperform existing methods for classification in simulated, cross-platforms, cross-species, spatial transcriptomic and COVID-19 immune datasets. Further quantitative evaluations and visualizations for the aligned embeddings confirm the superiority in cell mixing and the ability to preserve discriminative cluster structure present in the original datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据