期刊
JOURNAL OF CLINICAL MEDICINE
卷 11, 期 11, 页码 -出版社
MDPI
DOI: 10.3390/jcm11113125
关键词
grader comparison; diabetic retinopathy grading; deep learning; inter-grader variability; diabetic retinopathy screening
Poland lacks a widespread diabetic retinopathy (DR) screening program and trained graders. This study compared the performance of three retinal specialists with no additional DR grading training in assessing real-life screening images and their performance against IDx-DR. The results showed that the retina specialists had high grader variability and limited concordance with IDx-DR results.
Poland has never had a widespread diabetic retinopathy (DR) screening program and subsequently has no purpose-trained graders and no established grader training scheme. Herein, we compare the performance and variability of three retinal specialists with no additional DR grading training in assessing images from 335 real-life screening encounters and contrast their performance against IDx-DR, a US Food and Drug Administration (FDA) approved DR screening suite. A total of 1501 fundus images from 670 eyes were assessed by each grader with a final grade on a per-eye level. Unanimous agreement between all graders was achieved for 385 eyes, and 110 patients, out of which 98% had a final grade of no DR. Thirty-six patients had final grades higher than mild DR, out of which only two had no grader disagreements regarding severity. A total of 28 eyes underwent adjudication due to complete grader disagreement. Four patients had discordant grades ranging from no DR to severe DR between the human graders and IDx-DR. Retina specialists achieved kappa scores of 0.52, 0.78, and 0.61. Retina specialists had relatively high grader variability and only a modest concordance with IDx-DR results. Focused training and verification are recommended for any potential DR graders before assessing DR screening images.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据