4.5 Article

Defining genomic epidemiology thresholds for common-source bacterial outbreaks: a modelling study

期刊

LANCET MICROBE
卷 4, 期 5, 页码 e349-e357

出版社

ELSEVIER
DOI: 10.1016/S2666-5247(22)00380-9

关键词

-

向作者/读者索取更多资源

This study aimed to develop a hypothesis-based model to estimate genetic distance thresholds and mutation rates for single-strain food or environmental outbreaks. A forward model was developed to simulate bacterial evolution under specific mutation rates and outbreak durations. The model accurately discriminated outbreak and non-outbreak cases and estimated outbreak parameters. The model was validated using 16 published datasets of bacterial outbreaks.
Background Epidemiological surveillance relies on microbial strain typing, which defines genomic relatedness among isolates to identify case clusters and their potential sources. Although predefined thresholds are often applied, known outbreak-specific features such as pathogen mutation rate and duration of source contamination are rarely considered. We aimed to develop a hypothesis-based model that estimates genetic distance thresholds and mutation rates for point-source single-strain food or environmental outbreaks. Methods In this modelling study, we developed a forward model to simulate bacterial evolution at a specific mutation rate (mu) over a defined outbreak duration (D). From the distribution of genetic distances expected under the given outbreak parameters and sample isolation dates, we estimated a distance threshold beyond which isolates should not be considered as part of the outbreak. We embedded the model into a Markov Chain Monte Carlo inference framework to estimate the most probable mutation rate or time since source contamination, which are both often imprecisely documented. A simulation study validated the model over realistic durations and mutation rates. We then identified and analysed 16 published datasets of bacterial source-related outbreaks; datasets were included if they were from an identified foodborne outbreak and if whole-genome sequence data and collection dates for the described isolates were available. Findings Analysis of simulated data validated the accuracy of our framework in both discriminating between outbreak and non-outbreak cases and estimating the parameters D and mu from outbreak data. Precision of estimation was much higher for high values of D and mu. Sensitivity of outbreak cases was always very high, and specificity in detecting non-outbreak cases was poor for low mutation rates. For 14 of the 16 outbreaks, the classification of isolates as being outbreak-related or sporadic is consistent with the original dataset. Four of these outbreaks included outliers, which were correctly classified as being beyond the threshold of exclusion estimated by our model, except for one isolate of outbreak 4. For two outbreaks, both foodborne Listeria monocytogenes, conclusions from our model were discordant with published results: in one outbreak two isolates were classified as outliers by our model and in another outbreak our algorithm separated food samples into one cluster and human samples into another, whereas the isolates were initially grouped together based on epidemiological and genetic evidence. Re-estimated values of the duration of outbreak or mutation rate were largely consistent with a priori defined values. However, in several cases the estimated values were higher and improved the fit with the observed genetic distance distribution, suggesting that early outbreak cases are sometimes missed. Interpretation We propose here an evolutionary approach to the single-strain conundrum by estimating the genetic threshold and proposing the most probable cluster of cases for a given outbreak, as determined by its particular epidemiological and microbiological properties. This forward model, applicable to foodborne or environmental -source single point case clusters or outbreaks, is useful for epidemiological surveillance and may inform control measures.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据