4.7 Article

Determining optical mapping errors by simulations

期刊

BIOINFORMATICS
卷 37, 期 20, 页码 3391-3397

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab259

关键词

-

资金

  1. SGS project, VSB-Technical University of Ostrava [SP2021/94]
  2. Ministry of Health of the Czech Republic [NU20-06-00269]
  3. Celgene Research Grant [CZ-102]

向作者/读者索取更多资源

This article proposes a novel approach to modeling errors in the data obtained from the Bionano Genomics Inc. Saphyr system with Direct Label and Stain (DLS) chemistry. The main result is a framework for studying errors in the data obtained from the Saphyr instrument with DLS chemistry, using a simulation that computes how major sources of errors affect the distribution of fragment lengths in optical maps. The experiments show that this approach can be used to study errors in the optical mapping data analysis.
Motivation: Optical mapping is a complementary technology to traditional DNA sequencing technologies, such as next-generation sequencing (NGS). It provides genome-wide, high-resolution restriction maps from single, stained molecules of DNA. It can be used to detect large and small structural variants, copy number variations and complex rearrangements. Optical mapping is affected by different kinds of errors in comparison with traditional DNA sequencing technologies. It is important to understand the source of these errors and how they affect the obtained data. This article proposes a novel approach to modeling errors in the data obtained from the Bionano Genomics Inc. Saphyr system with Direct Label and Stain (DLS) chemistry. Some studies have already addressed this issue for older instruments with nicking enzymes, but we are unaware of a study that addresses this new system. Results: The main result is a framework for studying errors in the data obtained from the Saphyr instrument with DLS chemistry. The framework's main component is a simulation that computes how major sources of errors for this instrument (a false site, a missing site and resolution errors) affect the distribution of fragment lengths in optical maps. The simulation is parametrized by variables describing these errors and we are using a differential evolution algorithm to evaluate parameters that best fit the data from the instrument. Results of the experiments manifest that this approach can be used to study errors in the optical mapping data analysis.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据