Journal
GENOME BIOLOGY
Volume 22, Issue 1, Pages -Publisher
BMC
DOI: 10.1186/s13059-021-02386-z
Keywords
Single-cell RNA-Seq; UMI; Droplet-based; PCR; Bias; Amplification; Batch correction; Correction
Funding
- Knut and Alice Wallenberg foundation
- National Cancer Institute of the National Institutes of Health [F32CA220848]
- NIH [U19MH114830]
Ask authors/readers for more resources
This study introduces UMIs in single-cell RNA-seq assays to identify duplicated molecules, presenting an improved quantification method called BUTTERFLY based on unseen species modeling. This method effectively addresses the bias caused by the naive removal of duplicates by demonstrating its efficacy across cell types and genes, sometimes even inverting the relative abundance of genes.
The incorporation of unique molecular identifiers (UMIs) in single-cell RNA-seq assays makes possible the identification of duplicated molecules, thereby facilitating the counting of distinct molecules from sequenced reads. However, we show that the naive removal of duplicates can lead to a bias due to a pooled amplification paradox, and we propose an improved quantification method based on unseen species modeling. Our correction called BUTTERFLY uses a zero truncated negative binomial estimator implemented in the kallisto bustools workflow. We demonstrate its efficacy across cell types and genes and show that in some cases it can invert the relative abundance of genes.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available