4.7 Article

UMIErrorCorrect and UMIAnalyzer: Software for Consensus Read Generation, Error Correction, and Visualization Using Unique Molecular Identifiers

期刊

CLINICAL CHEMISTRY
卷 68, 期 11, 页码 1425-1435

出版社

OXFORD UNIV PRESS INC
DOI: 10.1093/clinchem/hvac136

关键词

-

资金

  1. Assar Gabrielssons Research Foundation
  2. Johan Jansson Foundation for Cancer Research
  3. Sahlgrenska University Hospital
  4. Assar Gabrielssons Research, Foundation
  5. Lion's Cancer Research Fund of Western Sweden
  6. Wilhelm and Martina Lundgren Foundation
  7. University of Gothenburg
  8. Anna-Lisa och Bror Bjornsson stiftelse
  9. AstraZeneca
  10. Simsen Diagnostics
  11. Region Vastra Gotaland, Sweden
  12. Swedish Research Council [2020-01008]
  13. Swedish government [965065]
  14. Sweden's Innovation Agency [2018-00421, 2020-04141]
  15. Swedish Cancer Society [19-0306]
  16. Swedish Childhood Cancer Foundation [2020-007, MTI2019-0008]
  17. Sjoberg Foundation
  18. Region Vastra Gotaland, Sahlgrenska University Hospital, University of Gothenburg
  19. Vinnova [2020-01008, 2018-00421] Funding Source: Vinnova
  20. Formas [2018-00421] Funding Source: Formas

向作者/读者索取更多资源

The study developed a new bioinformatics pipeline, UMIErrorCorrect, for analyzing sequencing data containing UMIs, and provided a user interface called UMIAnalyzer. Analysis of data demonstrated that UMIErrorCorrect showed higher sensitivity in variant detection for targeted UMI sequencing data.
Background Targeted sequencing using unique molecular identifiers (UMIs) enables detection of rare variant alleles in challenging applications, such as cell-free DNA analysis from liquid biopsies. Standard bioinformatics pipelines for data processing and variant calling are not adapted for deep-sequencing data containing UMIs, are inflexible, and require multistep workflows or dedicated computing resources. Methods We developed a bioinformatics pipeline using Python and an R package for data analysis and visualization. To validate our pipeline, we analyzed cell-free DNA reference material with known mutant allele frequencies (0%, 0.125%, 0.25%, and 1%) and public data sets. Results We developed UMIErrorCorrect, a bioinformatics pipeline for analyzing sequencing data containing UMIs. UMIErrorCorrect only requires fastq files as inputs and performs alignment, UMI clustering, error correction, and variant calling. We also provide UMIAnalyzer, a graphical user interface, for data mining, visualization, variant interpretation, and report generation. UMIAnalyzer allows the user to adjust analysis parameters and study their effect on variant calling. We demonstrated the flexibility of UMIErrorCorrect by analyzing data from 4 different targeted sequencing protocols. We also show its ability to detect different mutant allele frequencies in standardized cell-free DNA reference material. UMIErrorCorrect outperformed existing pipelines for targeted UMI sequencing data in terms of variant detection sensitivity. Conclusions UMIErrorCorrect and UMIAnalyzer are comprehensive and customizable bioinformatics tools that can be applied to any type of library preparation protocol and enrichment chemistry using UMIs. Access to simple, generic, and open-source bioinformatics tools will facilitate the implementation of UMI-based sequencing approaches in basic research and clinical applications.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据