4.5 Article

Joint Clustering of Single-Cell Sequencing and Fluorescence In Situ Hybridization Data for Reconstructing Clonal Heterogeneity in Cancers

Journal

JOURNAL OF COMPUTATIONAL BIOLOGY
Volume 28, Issue 11, Pages 1035-1051

Publisher

MARY ANN LIEBERT, INC
DOI: 10.1089/cmb.2021.0255

Keywords

cancer; clonal evolution; sequencing; single-cell genomics

Funding

  1. Intramural Research Program of the National Institutes of Health, National Library of Medicine
  2. Center for Cancer Research
  3. Division of Cancer Epidemiology and Genetics within the National Cancer Institute
  4. Exploration Program of the Shenzhen Science and Technology Innovation Committee [JCYJ20170303151334808]
  5. US NIH [R21CA216452]
  6. Pennsylvania Dept. of Health [4100070287, FP00003273]
  7. National Human Genome Research Institute of the National Institutes of Health [R01HG010589]

Ask authors/readers for more resources

The study developed a joint clustering method combining single-cell DNA sequencing and multiplex interphase fluorescence in situ hybridization data to better understand ploidy changes in tumor evolution.
Aneuploidy and whole genome duplication (WGD) events are common features of cancers associated with poor outcomes, but the ways they influence trajectories of clonal evolution are poorly understood. Phylogenetic methods for reconstructing clonal evolution from genomic data have proven a powerful tool for understanding how clonal evolution occurs in the process of cancer progression, but extant methods so far have limited the ability to resolve tumor evolution via ploidy changes. This limitation exists in part because single-cell DNA-sequencing (scSeq), which has been crucial to developing detailed profiles of clonal evolution, has difficulty in resolving ploidy changes and WGD. Multiplex interphase fluorescence in situ hybridization (miFISH) provides a more unambiguous signal of single-cell ploidy changes but it is limited to profiling small numbers of single markers. Here, we develop a joint clustering method to combine these two data sources with the goal of better resolving ploidy changes in tumor evolution. We develop a probabilistic framework to maximize the probability of latent variables given the pre-clustered datasets, which we optimize via Markov chain Monte Carlo sampling combined with linear regression. We validate the method by using simulated data derived from a glioblastoma (GBM) case profiled by both scSeq and miFISH. We further apply the method to two GBM cases with scSeq and miFISH data by reconstructing a phylogenetic tree from the joint clustering results, demonstrating their synergistic value in understanding how focal copy number changes and WGD events can collectively contribute to tumor progression.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available