期刊
AMERICAN JOURNAL OF EPIDEMIOLOGY
卷 191, 期 11, 页码 1936-1943出版社
OXFORD UNIV PRESS INC
DOI: 10.1093/aje/kwac117
关键词
cluster growth; machine learning; surveillance data; tuberculosis
资金
- CDC's Division of Tuberculosis Elimination
This study demonstrates the use of surveillance data, statistical definitions, and machine learning to predict clusters of tuberculosis cases that are likely to grow and become outbreaks, providing an opportunity for intervention and prevention.
The early identification of clusters of persons with tuberculosis (TB) that will grow to become outbreaks creates an opportunity for intervention in preventing future TB cases. We used surveillance data (2009-2018) from the United States, statistically derived definitions of unexpected growth, and machine-learning techniques to predict which clusters of genotype-matched TB cases are most likely to continue accumulating cases above expected growth within a 1-year follow-up period. We developed a model to predict which clusters are likely to grow on a training and testing data set that was generalizable to a validation data set. Our model showed that characteristics of clusters were more important than the social, demographic, and clinical characteristics of the patients in those clusters. For instance, the time between cases before unexpected growth was identified as the most important of our predictors. A faster accumulation of cases increased the probability of excess growth being predicted during the follow-up period. We have demonstrated that combining the characteristics of clusters and cases with machine learning can add to existing tools to help prioritize which clusters may benefit most from public health interventions. For example, consideration of an entire cluster, not only an individual patient, may assist in interrupting ongoing transmission.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据