☆ 4.6 Article

CBA: Cluster-Guided Batch Alignment for Single Cell RNA-seq

FRONTIERS IN GENETICS (2021)

Journal

FRONTIERS IN GENETICS

Volume 12, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA

DOI: 10.3389/fgene.2021.644211

Keywords

batch correction; auto-encoder; single-cell RNA sequencing; clustering; data integration

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

The power of single-cell RNA sequencing in detecting cell heterogeneity or developmental process is increasing, and combining two batches of scRNA-seq data requires solving technical differences, which can be further constrained by matching cells and cell types. In this study, an auto-encoder was utilized to achieve this goal, and the performance was evaluated against other alignment methods by preserving cluster separation and identifying biologically meaningful differential gene expressions.

The power of single-cell RNA sequencing (scRNA-seq) in detecting cell heterogeneity or developmental process is becoming more and more evident every day. The granularity of this knowledge is further propelled when combining two batches of scRNA-seq into a single large dataset. This strategy is however hampered by technical differences between these batches. Typically, these batch effects are resolved by matching similar cells across the different batches. Current approaches, however, do not take into account that we can constrain this matching further as cells can also be matched on their cell type identity. We use an auto-encoder to embed two batches in the same space such that cells are matched. To accomplish this, we use a loss function that preserves: (1) cell-cell distances within each of the two batches, as well as (2) cell-cell distances between two batches when the cells are of the same cell-type. The cell-type guidance is unsupervised, i.e., a cell-type is defined as a cluster in the original batch. We evaluated the performance of our cluster-guided batch alignment (CBA) using pancreas and mouse cell atlas datasets, against six state-of-the-art single cell alignment methods: Seurat v3, BBKNN, Scanorama, Harmony, LIGER, and BERMUDA. Compared to other approaches, CBA preserves the cluster separation in the original datasets while still being able to align the two datasets. We confirm that this separation is biologically meaningful by identifying relevant differential expression of genes for these preserved clusters.

CBA: Cluster-Guided Batch Alignment for Single Cell RNA-seq

Journal

FRONTIERS IN GENETICS

Publisher

FRONTIERS MEDIA SA

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

CBA: Cluster-Guided Batch Alignment for Single Cell RNA-seq

Journal

FRONTIERS IN GENETICS

Publisher

FRONTIERS MEDIA SA

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper