4.7 Article

Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching

Journal

GENOME RESEARCH
Volume 31, Issue 4, Pages 698-712

Publisher

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/gr.261115.120

Keywords

-

Funding

  1. U.S. National Institutes of Health [R01GM130738, R01GM126553]
  2. National Defense Basic Scientific Research Program of China [2015CB553704]
  3. Sloan Research Foundation Fellowship
  4. HCA Seed Network grant from the Chan Zuckerberg Initiative

Ask authors/readers for more resources

Dmatch is a method for aligning multiple scRNA-seq experiments, leveraging an external expression atlas and kernel density matching to integrate diverse datasets and extract biological insights. The method performs well in simulations and shows great potential when applied to clinical samples, enabling cell type-specific differential gene expression comparisons and revealing shared cell populations.
Single-cell RNA sequencing (scRNA-seq) technology is poised to replace bulk cell RNA sequencing for many biological and medical applications as it allows users to measure gene expression levels in a cell type?specific manner. However, data produced by scRNA-seq often exhibit batch effects that can be specific to a cell type, to a sample, or to an experiment, which prevent integration or comparisons across multiple experiments. Here, we present Dmatch, a method that leverages an external expression atlas of human primary cells and kernel density matching to align multiple scRNA-seq experiments for downstream biological analysis. Dmatch facilitates alignment of scRNA-seq data sets with cell types that may overlap only partially and thus allows integration of multiple distinct scRNA-seq experiments to extract biological insights. In simulation, Dmatch compares favorably to other alignment methods, both in terms of reducing sample-specific clustering and in terms of avoiding overcorrection. When applied to scRNA-seq data collected from clinical samples in a healthy individual and five autoimmune disease patients, Dmatch enabled cell type?specific differential gene expression comparisons across biopsy sites and disease conditions and uncovered a shared population of pro-inflammatory monocytes across biopsy sites in RA patients. We further show that Dmatch increases the number of eQTLs mapped from population scRNA-seq data. Dmatch is fast, scalable, and improves the utility of scRNA-seq for several important applications. Dmatch is freely available online.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available