4.5 Article

Generalizable characteristics of false- positive bacterial variant calls

期刊

MICROBIAL GENOMICS
卷 7, 期 8, 页码 -

出版社

MICROBIOLOGY SOC
DOI: 10.1099/mgen.0.000615

关键词

false positive; variant calling; best practice; benchmarking

资金

  1. National Institute for Health Research Health Protection Research Unit (NIHR HPRU) in Healthcare Associated Infections and Antimicrobial Resistance at Oxford University
  2. Public Health England (PHE) [HPRU-2012-10041]
  3. NIHR Oxford Biomedical Centre
  4. Health Data Research UK
  5. NIHR Oxford Biomedical Research Centre
  6. National Institute for Health Research

向作者/读者索取更多资源

Minimizing false positives is crucial in variant calling, and using Snippy is found to be the most direct way to achieve this. The study also highlights the issue of a disproportionate number of false calls near indels in variant calling pipelines.
Minimizing false positives is a critical issue when variant calling as no method is without error. It is common practice to post- process a variant- call file (VCF) using hard filter criteria intended to discriminate true- positive (TP) from false- positive (FP) calls. These are applied on the simple principle that certain characteristics are disproportionately represented among the set of FP calls and that a user- chosen threshold can maximize the number detected. To provide guidance on this issue, this study empirically characterized all false SNP and indel calls made using real Illumina sequencing data from six disparate species and 166 variant- calling pipelines (the combination of 14 read aligners with up to 13 different variant callers, plus four 'all- in- one' pipelines). We did not seek to optimize filter thresholds but instead to draw attention to those filters of greatest efficacy and the pipelines to which they may most usefully be applied. In this respect, this study acts as a coda to our previous benchmarking evaluation of bacterial variant callers, and provides general recommendations for effective practice. The results suggest that, of the pipelines analysed in this study, the most straightforward way of minimizing false positives would simply be to use Snippy. We also find that a disproportionate number of false calls, irrespective of the variant- calling pipeline, are located in the vicinity of indels, and highlight this as an issue for future development.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据