4.8 Article

Identifying phenotype-associated subpopulations by integrating bulk and single-cell sequencing data

Journal

NATURE BIOTECHNOLOGY
Volume 40, Issue 4, Pages 527-+

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41587-021-01091-3

Keywords

-

Funding

  1. NIH [5K01LM012877, 1R21HL145426, 1R01CA207377]
  2. NIH NIGMS [MIRA R35GM124704]
  3. Medical Research Foundation of Oregon
  4. NCI [R01 CA251245, P50 CA097186, P50 CA186786, P50 CA186786-07S1, R01CA244576]
  5. Department of Defense [W81XWH-16-1-0597]

Ask authors/readers for more resources

Scissor is a method that identifies cell subpopulations associated with a given phenotype from single-cell data, by quantifying the similarity between single cells and bulk samples, and optimizing a regression model with sample phenotype. Applied in lung cancer, melanoma, facioscapulohumeral muscular dystrophy, and Alzheimer's disease datasets, Scissor effectively identifies biologically and clinically relevant cell subpopulations.
Bulk and single cell measurements are integrated to identify phenotype-associated subpopulations of cells. Single-cell RNA sequencing (scRNA-seq) distinguishes cell types, states and lineages within the context of heterogeneous tissues. However, current single-cell data cannot directly link cell clusters with specific phenotypes. Here we present Scissor, a method that identifies cell subpopulations from single-cell data that are associated with a given phenotype. Scissor integrates phenotype-associated bulk expression data and single-cell data by first quantifying the similarity between each single cell and each bulk sample. It then optimizes a regression model on the correlation matrix with the sample phenotype to identify relevant subpopulations. Applied to a lung cancer scRNA-seq dataset, Scissor identified subsets of cells associated with worse survival and with TP53 mutations. In melanoma, Scissor discerned a T cell subpopulation with low PDCD1/CTLA4 and high TCF7 expression associated with an immunotherapy response. Beyond cancer, Scissor was effective in interpreting facioscapulohumeral muscular dystrophy and Alzheimer's disease datasets. Scissor identifies biologically and clinically relevant cell subpopulations from single-cell assays by leveraging phenotype and bulk-omics datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available