4.6 Article

Using Machine Learning Techniques and National Tuberculosis Surveillance Data to Predict Excess Growth in Genotyped Tuberculosis Clusters

期刊

AMERICAN JOURNAL OF EPIDEMIOLOGY
卷 191, 期 11, 页码 1936-1943

出版社

OXFORD UNIV PRESS INC
DOI: 10.1093/aje/kwac117

关键词

cluster growth; machine learning; surveillance data; tuberculosis

资金

  1. CDC's Division of Tuberculosis Elimination

向作者/读者索取更多资源

This study demonstrates the use of surveillance data, statistical definitions, and machine learning to predict clusters of tuberculosis cases that are likely to grow and become outbreaks, providing an opportunity for intervention and prevention.
The early identification of clusters of persons with tuberculosis (TB) that will grow to become outbreaks creates an opportunity for intervention in preventing future TB cases. We used surveillance data (2009-2018) from the United States, statistically derived definitions of unexpected growth, and machine-learning techniques to predict which clusters of genotype-matched TB cases are most likely to continue accumulating cases above expected growth within a 1-year follow-up period. We developed a model to predict which clusters are likely to grow on a training and testing data set that was generalizable to a validation data set. Our model showed that characteristics of clusters were more important than the social, demographic, and clinical characteristics of the patients in those clusters. For instance, the time between cases before unexpected growth was identified as the most important of our predictors. A faster accumulation of cases increased the probability of excess growth being predicted during the follow-up period. We have demonstrated that combining the characteristics of clusters and cases with machine learning can add to existing tools to help prioritize which clusters may benefit most from public health interventions. For example, consideration of an entire cluster, not only an individual patient, may assist in interrupting ongoing transmission.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据