4.5 Article

BUTTERFLY: addressing the pooled amplification paradox with unique molecular identifiers in single-cell RNA-seq

Journal

GENOME BIOLOGY
Volume 22, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s13059-021-02386-z

Keywords

Single-cell RNA-Seq; UMI; Droplet-based; PCR; Bias; Amplification; Batch correction; Correction

Funding

  1. Knut and Alice Wallenberg foundation
  2. National Cancer Institute of the National Institutes of Health [F32CA220848]
  3. NIH [U19MH114830]

Ask authors/readers for more resources

This study introduces UMIs in single-cell RNA-seq assays to identify duplicated molecules, presenting an improved quantification method called BUTTERFLY based on unseen species modeling. This method effectively addresses the bias caused by the naive removal of duplicates by demonstrating its efficacy across cell types and genes, sometimes even inverting the relative abundance of genes.
The incorporation of unique molecular identifiers (UMIs) in single-cell RNA-seq assays makes possible the identification of duplicated molecules, thereby facilitating the counting of distinct molecules from sequenced reads. However, we show that the naive removal of duplicates can lead to a bias due to a pooled amplification paradox, and we propose an improved quantification method based on unseen species modeling. Our correction called BUTTERFLY uses a zero truncated negative binomial estimator implemented in the kallisto bustools workflow. We demonstrate its efficacy across cell types and genes and show that in some cases it can invert the relative abundance of genes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available