4.7 Article

In vitro and in silico parameters for precise cgMLST typing of Listeria monocytogenes

期刊

BMC GENOMICS
卷 23, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12864-022-08437-4

关键词

cgMLST; Comparability of workflows; Listeria monocytogenes; Principal component analysis; Generalized linear model

资金

  1. European Joint Programme (EJP) dedicated to One Health Structure In Europe (COHESIVE) [773830]

向作者/读者索取更多资源

This study identified parameters influencing the precision of cgMLST profiles in Listeria monocytogenes, highlighting the impact of genetic background, cgMLST workflows, completeness, depth, and breadth of coverage on precision. All tested workflows performed well at >= 40X depth of coverage, showing consistent cluster definitions using a reference cut-off of <= 7 allele differences. The research suggests that bioinformatics workflows dedicated to cgMLST allele calling are robust when paired-end reads are of high quality and sequencing depth is sufficient.
Background Whole genome sequencing analyzed by core genome multi-locus sequence typing (cgMLST) is widely used in surveillance of the pathogenic bacteria Listeria monocytogenes. Given the heterogeneity of available bioinformatics tools to define cgMLST alleles, our aim was to identify parameters influencing the precision of cgMLST profiles. Methods We used three L. monocytogenes reference genomes from different phylogenetic lineages and assessed the impact of in vitro (i.e. tested genomes, successive platings, replicates of DNA extraction and sequencing) and in silico parameters (i.e. targeted depth of coverage, depth of coverage, breadth of coverage, assembly metrics, cgMLST workflows, cgMLST completeness) on cgMLST precision made of 1748 core loci. Six cgMLST workflows were tested, comprising assembly-based (BIGSdb, INNUENDO, GENPAT, SeqSphere and BioNumerics) and assembly-free (i.e. kmer-based MentaLiST) allele callers. Principal component analyses and generalized linear models were used to identify the most impactful parameters on cgMLST precision. Results The isolate's genetic background, cgMLST workflows, cgMLST completeness, as well as depth and breadth of coverage were the parameters that impacted most on cgMLST precision (i.e. identical alleles against reference circular genomes). All workflows performed well at >= 40X of depth of coverage, with high loci detection (> 99.54% for all, except for BioNumerics with 97.78%) and showed consistent cluster definitions using the reference cut-off of <= 7 allele differences. Conclusions This highlights that bioinformatics workflows dedicated to cgMLST allele calling are largely robust when paired-end reads are of high quality and when the sequencing depth is >= 40X.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据