4.7 Article

Chest X-ray Analysis With Deep Learning-Based Software as a Triage Test for Pulmonary Tuberculosis: An Individual Patient Data Meta-Analysis of Diagnostic Accuracy

期刊

CLINICAL INFECTIOUS DISEASES
卷 74, 期 8, 页码 1390-1400

出版社

OXFORD UNIV PRESS INC
DOI: 10.1093/cid/ciab639

关键词

tuberculosis; chest X-ray; deep learning; individual patient data meta-analysis; accuracy

资金

  1. L'Observatoire International Sur Les Impacts Societaux de l'Intelligence Artificielle (Fonds de recherche Quebec)

向作者/读者索取更多资源

This study found that the accuracy of commercially available deep learning-based CAD for detecting tuberculosis varied between different populations, suggesting the need for tailored application based on specific patient characteristics.
Background Automated radiologic analysis using computer-aided detection software (CAD) could facilitate chest X-ray (CXR) use in tuberculosis diagnosis. There is little to no evidence on the accuracy of commercially available deep learning-based CAD in different populations, including patients with smear-negative tuberculosis and people living with human immunodeficiency virus (HIV, PLWH). Methods We collected CXRs and individual patient data (IPD) from studies evaluating CAD in patients self-referring for tuberculosis symptoms with culture or nucleic acid amplification testing as the reference. We reanalyzed CXRs with three CAD programs (CAD4TB version (v) 6, Lunit v3.1.0.0, and qXR v2). We estimated sensitivity and specificity within each study and pooled using IPD meta-analysis. We used multivariable meta-regression to identify characteristics modifying accuracy. Results We included CXRs and IPD of 3727/3967 participants from 4/7 eligible studies. 17% (621/3727) were PLWH. 17% (645/3727) had microbiologically confirmed tuberculosis. Despite using the same threshold score for classifying CXR in every study, sensitivity and specificity varied from study to study. The software had similar unadjusted accuracy (at 90% pooled sensitivity, pooled specificities were: CAD4TBv6, 56.9% [95% confidence interval {CI}: 51.7-61.9]; Lunit, 54.1% [95% CI: 44.6-63.3]; qXRv2, 60.5% [95% CI: 51.7-68.6]). Adjusted absolute differences in pooled sensitivity between PLWH and HIV-uninfected participants were: CAD4TBv6, -13.4% [-21.1, -6.9]; Lunit, +2.2% [-3.6, +6.3]; qXRv2: -13.4% [-21.5, -6.6]; between smear-negative and smear-positive tuberculosis was: were CAD4TBv6, -12.3% [-19.5, -6.1]; Lunit, -17.2% [-24.6, -10.5]; qXRv2, -16.6% [-24.4, -9.9]. Accuracy was similar to human readers. Conclusions For CAD CXR analysis to be implemented as a high-sensitivity tuberculosis rule-out test, users will need threshold scores identified from their own patient populations and stratified by HIV and smear status. An individual patient data (IPD) meta-analysis found the accuracy of commercially available deep learning-based chest X-ray analysis software for detecting tuberculosis varied between studies and by patient characteristics. Diagnostic heterogeneity poses an implementation challenge for this novel technology.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据