4.7 Article

A Bayesian method to cluster single-cell RNA sequencing data using copy number alterations

Journal

BIOINFORMATICS
Volume 38, Issue 9, Pages 2512-2518

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac143

Keywords

-

Funding

  1. AIRC [MFAG 2020-ID, 24913]

Ask authors/readers for more resources

CONGAS is a Bayesian probabilistic method that can determine the composition of tumor subclones and differentiate RNA expression differences by jointly identifying clusters of single cells. It has important implications for cancer research.
Motivation: Cancers are composed by several heterogeneous subpopulations, each one harbouring different genetic and epigenetic somatic alterations that contribute to disease onset and therapy response. In recent years, copy number alterations (CNAs) leading to tumour aneuploidy have been identified as potential key drivers of such populations, but the definition of the precise makeup of cancer subclones from sequencing assays remains challenging. In the end, little is known about the mapping between complex CNAs and their effect on cancer phenotypes. Results: We introduce CONGAS, a Bayesian probabilistic method to phase bulk DNA and single-cell RNA measurements from independent assays. CONGAS jointly identifies clusters of single cells with subclonal CNAs, and differences in RNA expression. The model builds statistical priors leveraging bulk DNA sequencing data, does not require a normal reference and scales fast thanks to a GPU backend and variational inference. We test CONGAS on both simulated and real data, and find that it can determine the tumour subclonal composition at the single-cell level together with clone-specific RNA phenotypes in tumour data generated from both 10x and Smart-Seq assays.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available