4.7 Article

Genome Rearrangement Distance With a Flexible Intergenic Regions Aspect

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2022.3165443

关键词

Genome rearrangements; intergenic regions; reversals; transpositions

向作者/读者索取更多资源

Most existing mathematical models for genome rearrangement problems consider only gene order, but recent research has started to incorporate more information such as intergenic region sizes. This study investigates a new variation where the target intergenic region sizes have a range of acceptable values, allowing flexibility in transforming gene order while considering variations in intergenic regions. The authors present approximation algorithms and an NP-hardness proof for the problem, relying on the Flexible Weighted Cycle Graph.
Most mathematical models for genome rearrangement problems have considered only gene order. In this way, the rearrangement distance considering some set of events, such as reversal and transposition events, is commonly defined as the minimum number of rearrangement events that transform the gene order from a genome G(1) into the gene order from a genome G(2). Recent works initiate incorporating more information such as the sizes of the intergenic regions (i.e., number of nucleotides between pairs of consecutive genes), which yields good results for estimated distances on real data. In these models, besides transforming the gene order, the sequence of rearrangement events must transform the list of intergenic regions sizes from G(1) into the list of intergenic regions sizes from G(2) (target list). We study a new variation where the target list is flexible, in the sense that each target intergenic region size is in a range of acceptable values. This allows us to model scenarios where the main objective is still to transform the order of genes from the source genome into the target genome, allowing flexibility in the sizes of the intergenic regions, since the nucleotides in these regions tend to undergo more changes when compared to genes. We investigate the rearrangement distance considering three sets of events, two with the exclusive use of reversals or transpositions, and the other allowing both rearrangement events. We present approximation algorithms for the problems and an NP-hardness proof. Our results rely on the Flexible Weighted Cycle Graph, adapted from the breakpoint graph to deal with flexible intergenic regions sizes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据