4.6 Article Proceedings Paper

General deep learning model for detecting diabetic retinopathy

期刊

BMC BIOINFORMATICS
卷 22, 期 SUPPL 5, 页码 -

出版社

BMC
DOI: 10.1186/s12859-021-04005-x

关键词

SMOTE; Overfitting; Decision tree; Nasnet-large; Transfer learning

资金

  1. Ministry of Science Technology, Taiwan [MOST 108-3111-Y-016-012]

向作者/读者索取更多资源

This study tackles the issue of overfitting in deep learning for diabetic retinopathy diagnosis by using a 2-stage training method and employing techniques like SMOTE synthetic datasets and early stopping to reduce overfitting. The developed general deep learning model for detecting DR achieved good prediction accuracy on multiple datasets, providing a simple approach to address database imbalances in medical images.
Background Doctors can detect symptoms of diabetic retinopathy (DR) early by using retinal ophthalmoscopy, and they can improve diagnostic efficiency with the assistance of deep learning to select treatments and support personnel workflow. Conventionally, most deep learning methods for DR diagnosis categorize retinal ophthalmoscopy images into training and validation data sets according to the 80/20 rule, and they use the synthetic minority oversampling technique (SMOTE) in data processing (e.g., rotating, scaling, and translating training images) to increase the number of training samples. Oversampling training may lead to overfitting of the training model. Therefore, untrained or unverified images can yield erroneous predictions. Although the accuracy of prediction results is 90%-99%, this overfitting of training data may distort training module variables. Results This study uses a 2-stage training method to solve the overfitting problem. In the training phase, to build the model, the Learning module 1 used to identify the DR and no-DR. The Learning module 2 on SMOTE synthetic datasets to identify the mild-NPDR, moderate NPDR, severe NPDR and proliferative DR classification. These two modules also used early stopping and data dividing methods to reduce overfitting by oversampling. In the test phase, we use the DIARETDB0, DIARETDB1, eOphtha, MESSIDOR, and DRIVE datasets to evaluate the performance of the training network. The prediction accuracy achieved to 85.38%, 84.27%, 85.75%, 86.73%, and 92.5%. Conclusions Based on the experiment, a general deep learning model for detecting DR was developed, and it could be used with all DR databases. We provided a simple method of addressing the imbalance of DR databases, and this method can be used with other medical images.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据