4.7 Article

Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching

期刊

GENOME RESEARCH
卷 31, 期 4, 页码 698-712

出版社

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/gr.261115.120

关键词

-

资金

  1. U.S. National Institutes of Health [R01GM130738, R01GM126553]
  2. National Defense Basic Scientific Research Program of China [2015CB553704]
  3. Sloan Research Foundation Fellowship
  4. HCA Seed Network grant from the Chan Zuckerberg Initiative

向作者/读者索取更多资源

Dmatch is a method for aligning multiple scRNA-seq experiments, leveraging an external expression atlas and kernel density matching to integrate diverse datasets and extract biological insights. The method performs well in simulations and shows great potential when applied to clinical samples, enabling cell type-specific differential gene expression comparisons and revealing shared cell populations.
Single-cell RNA sequencing (scRNA-seq) technology is poised to replace bulk cell RNA sequencing for many biological and medical applications as it allows users to measure gene expression levels in a cell type?specific manner. However, data produced by scRNA-seq often exhibit batch effects that can be specific to a cell type, to a sample, or to an experiment, which prevent integration or comparisons across multiple experiments. Here, we present Dmatch, a method that leverages an external expression atlas of human primary cells and kernel density matching to align multiple scRNA-seq experiments for downstream biological analysis. Dmatch facilitates alignment of scRNA-seq data sets with cell types that may overlap only partially and thus allows integration of multiple distinct scRNA-seq experiments to extract biological insights. In simulation, Dmatch compares favorably to other alignment methods, both in terms of reducing sample-specific clustering and in terms of avoiding overcorrection. When applied to scRNA-seq data collected from clinical samples in a healthy individual and five autoimmune disease patients, Dmatch enabled cell type?specific differential gene expression comparisons across biopsy sites and disease conditions and uncovered a shared population of pro-inflammatory monocytes across biopsy sites in RA patients. We further show that Dmatch increases the number of eQTLs mapped from population scRNA-seq data. Dmatch is fast, scalable, and improves the utility of scRNA-seq for several important applications. Dmatch is freely available online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据