4.6 Article

Detection and inference of interspersed duplicated insertions from paired-end reads

期刊

DIGITAL SIGNAL PROCESSING
卷 111, 期 -, 页码 -

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.dsp.2020.102959

关键词

Interspersed duplicated insertions; Next-generation sequencing; Paired-end reads; Insertion contents; Dynamic process

资金

  1. Natural Science Foundation of China [61571341]

向作者/读者索取更多资源

Interspersed duplicated insertion (idINS) is a common type of genomic insertion that plays a significant role in genomic instability and cancer genesis. The novel algorithm DIPins accurately detects and infers idINS contents from paired-end reads, even when the variation exceeds the insert size. DIPins shows advantages over existing methods and has potential for accurate characterization of idINSs in the human genome.
Interspersed duplicated insertion (idINS) is a common type of genomic insertion and plays an important role in genomic instability in cancer genesis. Nevertheless, the detection of such type of insertions is challenging, since the reads originated from idINS regions in the donor sample are most likely to be mapped perfectly to other regions in the reference. Most of the existing approaches adopt paired-end mapping to detect idINSs, but the characterization of idINSs larger than the mean insert size is still challenging due to the short sequencing reads. Therefore, there is still a need for practical algorithms to detect and infer idINSs regardless of their lengths. Here, we present a new algorithm, called DIPins, which can accurately detect and infer idINSs contents from paired-end reads. DIPins is capable of detecting breakpoint positions and inferring the contents of idINSs even when the length of variation exceeds the paired-end insert size. The major principle of DIPins is that it extracts multiple signatures from split reads and integrates them to determine idINS positions and adopts a dynamic process to construct idINS contents by iteratively generating unobserved split reads from the restricted area around the idINS breakpoint. We test the performance of DIPins on both simulation and real data. The results demonstrate its advantages over other methods and its potential application prospects in the accurate characterization of idINSs in human genome. (C) 2021 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据