4.6 Article

Generalization Challenges in Drug-Resistant Tuberculosis Detection from Chest X-rays

期刊

DIAGNOSTICS
卷 12, 期 1, 页码 -

出版社

MDPI
DOI: 10.3390/diagnostics12010188

关键词

Tuberculosis (TB); drug resistance; deep learning; chest X-rays; generalization; localization

资金

  1. Office of the Secretary Patient-Centered Outcomes Research Trust Fund (OS-PCORTF) [750119PE080057]
  2. Intramural Research Program of the National Library of Medicine, National Institutes of Health
  3. National Institute of Allergy and Infectious Diseases under BCBB Support Services [HHSN316201300006W/HHSN27200002]

向作者/读者索取更多资源

The classification of drug-resistant tuberculosis (DR-TB) and drug-sensitive tuberculosis (DS-TB) from chest radiographs is still an unresolved problem. Previous performance on publicly available chest X-ray data achieved 85% AUC with a deep convolutional neural network (CNN), but significant performance degradation was observed when applied to unseen data. In this paper, the generalizability of the models on images from a different country's dataset is investigated, and the lack of good generalization is explored. Comparisons between radiologist-annotated lesion locations and the trained model's localization using GradCAM show little overlap.
Classification of drug-resistant tuberculosis (DR-TB) and drug-sensitive tuberculosis (DS-TB) from chest radiographs remains an open problem. Our previous cross validation performance on publicly available chest X-ray (CXR) data combined with image augmentation, the addition of synthetically generated and publicly available images achieved a performance of 85% AUC with a deep convolutional neural network (CNN). However, when we evaluated the CNN model trained to classify DR-TB and DS-TB on unseen data, significant performance degradation was observed (65% AUC). Hence, in this paper, we investigate the generalizability of our models on images from a held out country's dataset. We explore the extent of the problem and the possible reasons behind the lack of good generalization. A comparison of radiologist-annotated lesion locations in the lung and the trained model's localization of areas of interest, using GradCAM, did not show much overlap. Using the same network architecture, a multi-country classifier was able to identify the country of origin of the X-ray with high accuracy (86%), suggesting that image acquisition differences and the distribution of non-pathological and non-anatomical aspects of the images are affecting the generalization and localization of the drug resistance classification model as well. When CXR images were severely corrupted, the performance on the validation set was still better than 60% AUC. The model overfitted to the data from countries in the cross validation set but did not generalize to the held out country. Finally, we applied a multi-task based approach that uses prior TB lesions location information to guide the classifier network to focus its attention on improving the generalization performance on the held out set from another country to 68% AUC.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据