4.5 Article

A Machine Learning Model for Food Source Attribution of Listeria monocytogenes

期刊

PATHOGENS
卷 11, 期 6, 页码 -

出版社

MDPI
DOI: 10.3390/pathogens11060691

关键词

Listeria monocytogenes; food source attribution; whole-genome sequencing; machine learning; predictive modeling

向作者/读者索取更多资源

Machine learning models were used to predict the food source of clinical Listeria monocytogenes isolates, with the logit boost algorithm performing the best. The model identified dairy, fruits, leafy greens as the main sources of human clinical cases of L. monocytogenes, and provided predictive genetic features for specific sources.
Despite its low morbidity, listeriosis has a high mortality rate due to the severity of its clinical manifestations. The source of human listeriosis is often unclear. In this study, we investigate the ability of machine learning to predict the food source from which clinical Listeria monocytogenes isolates originated. Four machine learning classification algorithms were trained on core genome multilocus sequence typing data of 1212 L. monocytogenes isolates from various food sources. The average accuracies of random forest, support vector machine radial kernel, stochastic gradient boosting, and logit boost were found to be 0.72, 0.61, 0.7, and 0.73, respectively. Logit boost showed the best performance and was used in model testing on 154 L. monocytogenes clinical isolates. The model attributed 17.5% of human clinical cases to dairy, 32.5% to fruits, 14.3% to leafy greens, 9.7% to meat, 4.6% to poultry, and 18.8% to vegetables. The final model also provided us with genetic features that were predictive of specific sources. Thus, this combination of genomic data and machine learning-based models can greatly enhance our ability to track L. monocytogenes from different food sources.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据