4.7 Article Proceedings Paper

Semi-deconvolution of bulk and single-cell RNA-seq data with application to metastatic progression in breast cancer

Journal

BIOINFORMATICS
Volume 38, Issue SUPPL 1, Pages 386-394

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac262

Keywords

-

Funding

  1. National Institutes of Health [R21CA216452, R01HG010589]
  2. Pennsylvania Department of Health [4100070287]
  3. Susan G. Komen for the Cure
  4. Mario Lemieux Foundation
  5. Breast Cancer Alliance
  6. AWS Machine Learning Research Awards
  7. Center for Machine Learning and Health Fellowship

Ask authors/readers for more resources

This study developed a new method to interpret sample collections for which only bulk RNA-seq is available for some samples using reference scRNA-seq. By integrating this information in a Quadratic Programming framework, the method can recover more accurate cell types and corresponding cell type abundances in bulk samples.
Motivation: Identifying cell types and their abundances and how these evolve during tumor progression is critical to understanding the mechanisms of metastasis and identifying predictors of metastatic potential that can guide the development of new diagnostics or therapeutics. Single-cell RNA sequencing (scRNA-seq) has been especially promising in resolving heterogeneity of expression programs at the single-cell level, but is not always feasible, e.g. for large cohort studies or longitudinal analysis of archived samples. In such cases, clonal subpopulations may still be inferred via genomic deconvolution, but deconvolution methods have limited ability to resolve fine clonal structure and may require reference cell type profiles that are missing or imprecise. Prior methods can eliminate the need for reference profiles but show unstable performance when few bulk samples are available. Results: In this work, we develop a new method using reference scRNA-seq to interpret sample collections for which only bulk RNA-seq is available for some samples, e.g. clonally resolving archived primary tissues using scRNA-seq from metastases. By integrating such information in a Quadratic Programming framework, our method can recover more accurate cell types and corresponding cell type abundances in bulk samples. Application to a breast tumor bone metastases dataset confirms the power of scRNA-seq data to improve cell type inference and quantification in same-patient bulk samples.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available