4.8 Article

Souporcell: robust clustering of single-cell RNA-seq data by genotype without reference genotypes

Journal

NATURE METHODS
Volume 17, Issue 6, Pages 615-+

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41592-020-0820-1

Keywords

-

Funding

  1. Wellcome Trust [206194/Z/17/Z, WT098503]
  2. MRC Career Development Award [G1100339]
  3. NIHR/Wellcome Trust Clinical Research Facility
  4. UK Medical Research Council [MR/L003120/1]
  5. British Heart Foundation [RG/13/13/30194, RG/18/13/33946]
  6. National Institute for Health Research (Cambridge Biomedical Research Centre at the Cambridge University Hospital's NHS Foundation Trust)
  7. MRC [G1100339, MR/L003120/1] Funding Source: UKRI

Ask authors/readers for more resources

Souporcell clusters single-cell RNA-seq data using genotype information without the use of a genotype reference. Methods to deconvolve single-cell RNA-sequencing (scRNA-seq) data are necessary for samples containing a mixture of genotypes, whether they are natural or experimentally combined. Multiplexing across donors is a popular experimental design that can avoid batch effects, reduce costs and improve doublet detection. By using variants detected in scRNA-seq reads, it is possible to assign cells to their donor of origin and identify cross-genotype doublets that may have highly similar transcriptional profiles, precluding detection by transcriptional profile. More subtle cross-genotype variant contamination can be used to estimate the amount of ambient RNA. Ambient RNA is caused by cell lysis before droplet partitioning and is an important confounder of scRNA-seq analysis. Here we develop souporcell, a method to cluster cells using the genetic variants detected within the scRNA-seq reads. We show that it achieves high accuracy on genotype clustering, doublet detection and ambient RNA estimation, as demonstrated across a range of challenging scenarios.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available