4.7 Article

Sorting cancer karyotypes using double-cut-and-joins, duplications and deletions

Journal

BIOINFORMATICS
Volume 37, Issue 11, Pages 1489-1496

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/bty381

Keywords

-

Funding

  1. Bella Walter Memorial Fund of the Israel Cancer Association
  2. Blavatnik Family foundation
  3. Edmond J. Safra Center for Bioinformatics at Tel-Aviv University

Ask authors/readers for more resources

This study introduces a model for the evolution of genomes with multiple gene copies, considering various types of operations including double-cut-and-joins, duplications, and deletions. It covers a broad spectrum of structural and numerical changes seen in tumor samples, and presents a method to reconstruct the full sequence of events during cancer evolution. The research advances the field by providing a more realistic approach to genome rearrangement analysis.
Motivation: Problems of genome rearrangement are central in both evolution and cancer research. Most genome rearrangement models assume that the genome contains a single copy of each gene and the only changes in the genome are structural, i.e. reordering of segments. In contrast, tumor genomes also undergo numerical changes such as deletions and duplications, and thus the number of copies of genes varies. Dealing with unequal gene content is a very challenging task, addressed by few algorithms to date. More realistic models are needed to help trace genome evolution during tumorigenesis. Results: Here, we present a model for the evolution of genomes with multiple gene copies using the operation types double-cut-and-joins, duplications and deletions. The events supported by the model are reversals, translocations, tandem duplications, segmental deletions and chromosomal amplifications and deletions, covering most types of structural and numerical changes observed in tumor samples. Our goal is to find a series of operations of minimum length that transform one karyotype into the other. We show that the problem is NP-hard and give an integer linear programming formulation that solves the problem exactly under some mild assumptions. We test our method on simulated genomes and on ovarian cancer genomes. Our study advances the state of the art in two ways: It allows a broader set of operations than extant models, thus being more realistic and it is the first study attempting to re-construct the full sequence of structural and numerical events during cancer evolution.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available