4.6 Article

Deep learning on digital mammography for expert-level diagnosis accuracy in breast cancer detection

期刊

MULTIMEDIA SYSTEMS
卷 28, 期 4, 页码 1263-1274

出版社

SPRINGER
DOI: 10.1007/s00530-021-00823-4

关键词

Deep learning; Digital mammography; Breast cancer; Diagnostic accuracy; Artificial intelligence

资金

  1. National Natural Science Foundation of China [61702453]
  2. Natural Science Foundation of Zhejiang Province [Q17F030003]

向作者/读者索取更多资源

This study demonstrates that a deep learning classification model trained on historical mammograms can achieve excellent diagnostic performance on new exams, outperforming radiologists in breast cancer diagnosis. The DL model can accurately localize lesions on mammograms even without specific lesion information during training. Furthermore, the impact of input image resolution and different DL model architectures on diagnostic accuracy was analyzed.
Recently, computer-aided diagnosis (CAD) systems powered by deep learning (DL) algorithms have shown excellent performance in the evaluation of digital mammography for breast cancer diagnosis. However, such systems typically require pixel-level annotations by expert radiologists which is prohibitively time-consuming and expensive. Medical institutes would wonder if a high-performance breast cancer CAD system can be trained by exploring their own huge amount of historical imaging data and corresponding diagnosis reports, without additional annotations workload of their radiologists. In this study, we show that a DL classification model trained on historical mammograms with only image-level pathology labels (which can be automatically extracted from medical reports) can achieve surprisingly good diagnostic performance on newly incoming exams compared with experienced radiologists. A DL model called DenseNet was trained and cross-validated with 5979 historical exams acquired before September 2017 with biopsy-verified pathology and tested with 1194 newly obtained cases after that. For both cross-validation and test sets, the ROCs generated by DL predictions were above the ROCs generated by ratings from radiologists. For the suspicious cases which radiologists suggest biopsy (BI-RADS category 4 and 5), the DL model can reject 60% of false biopsies on benign breasts while keeping 95% sensitivity. For the mammograms based on which radiologists were not able to make a diagnosis (BI-RADS 0), the DL model still achieved an AUC score of 79%. Moreover, the model is able to localize lesions on mammograms although such information was not provided in the training phase. Finally, the impact of input image resolution and different DL model architectures on the diagnostic accuracy were also presented and analyzed.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据