☆ 4.7 Article

KvarQ: targeted and direct variant calling from fastq reads of bacterial genomes

BMC GENOMICS (2014)

期刊

BMC GENOMICS

卷 15, 期 -, 页码 -

出版社

BMC

DOI: 10.1186/1471-2164-15-881

关键词

Whole genome sequencing; FastQ; Single nucleotide polymorphisms; In-silico SNP-typing; Mycobacterium tuberculosis

类别

Biotechnology & Applied Microbiology Genetics & Heredity

资金

National Institutes of Health [AI090928]
Swiss National Science Foundation [PP00P3_150750]
European Research Council [309540-EVODRTB]
SystemsX.ch
Swiss National Science Foundation (SNF) [PP00P3_150750] Funding Source: Swiss National Science Foundation (SNF)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Background: High-throughput DNA sequencing produces vast amounts of data, with millions of short reads that usually have to be mapped to a reference genome or newly assembled. Both reference-based mapping and de novo assembly are computationally intensive, generating large intermediary data files, and thus require bioinformatics skills that are often lacking in the laboratories producing the data. Moreover, many research and practical applications in microbiology require only a small fraction of the whole genome data. Results: We developed KvarQ, a new tool that directly scans fastq files of bacterial genome sequences for known variants, such as single nucleotide polymorphisms (SNP), bypassing the need of mapping all sequencing reads to a reference genome and de novo assembly. Instead, KvarQ loads testsuites that define specific SNPs or short regions of interest in a reference genome, and directly synthesizes the relevant results based on the occurrence of these markers in the fastq files. KvarQ has a versatile command line interface and a graphical user interface. KvarQ currently ships with two testsuites for Mycobacterium tuberculosis, but new testsuites for other organisms can easily be created and distributed. In this article, we demonstrate how KvarQ can be used to successfully detect all main drug resistance mutations and phylogenetic markers in 880 bacterial whole genome sequences. The average scanning time per genome sequence was two minutes. The variant calls of a subset of these genomes were validated with a standard bioinformatics pipeline and revealed >99% congruency. Conclusion: KvarQ is a user-friendly tool that directly extracts relevant information from fastq files. This enables researchers and laboratory technicians with limited bioinformatics expertise to scan and analyze raw sequencing data in a matter of minutes. KvarQ is open-source, and pre-compiled packages with a graphical user interface are available at http://www.swisstph.ch/kvarq.

KvarQ: targeted and direct variant calling from fastq reads of bacterial genomes

期刊

BMC GENOMICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

KvarQ: targeted and direct variant calling from fastq reads of bacterial genomes

期刊

BMC GENOMICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文