4.7 Article

Decomposing mosaic tandem repeats accurately from long reads

向作者/读者索取更多资源

Over the past 30 years, extended tandem repeats (TRs) have been associated with numerous diseases, mostly consisting of single repeat units. However, recent advancements in sequencing techniques have revealed the presence of mosaic TRs composed of different units in brain disorders. To address this issue, we propose an efficient algorithm called uTR that outperforms existing tools in terms of prediction accuracy and speed.
Motivation: Over the past 30 years, extended tandem repeats (TRs) have been correlated with similar to 60 diseases with high odds ratios, and most known TRs consist of single repeat units. However, in the last few years, mosaic TRs composed of different units have been found to be associated with several brain disorders by long-read sequencing techniques. Mosaic TRs are difficult-to-characterize sequence configurations that are usually confirmed by manual inspection. Widely used tools are not designed to solve the mosaic TR problem and often fail to properly decompose mosaic TRs. Results: We propose an efficient algorithm that can decompose mosaic TRs in the input string with high sensitivity. Using synthetic benchmark data, we demonstrate that our program named uTR outperforms TRF and RepeatMasker in terms of prediction accuracy, this is especially true when mosaic TRs are more complex, and uTR is faster than TRF and RepeatMasker in most cases.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据