4.6 Article

Artificial Intelligence for Retinopathy of Prematurity Validation of a Vascular Severity Scale against International Expert Diagnosis

期刊

OPHTHALMOLOGY
卷 129, 期 7, 页码 E69-E76

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ophtha.2022.02.008

关键词

Artificial intelligence; Deep learning; Disease classification; Interobserver agreement; Retinopathy of prematurity; Severity score

资金

  1. National Institutes of Health (Bethesda, MD) [R01EY19474, R01 EY031331, P30EY10572]
  2. Research to Prevent Blindness (New York, NY)

向作者/读者索取更多资源

This study validated a vascular severity score as an appropriate output for ROP SaMD by comparing it with the labels for stage and plus disease assigned by ICROP3 committee members. The results showed a high correlation between the vascular severity score and the diagnostic labels for plus disease and stage.
Purpose: To validate a vascular severity score as an appropriate output for artificial intelligence (AI) Software as a Medical Device (SaMD) for retinopathy of prematurity (ROP) through comparison with ordinal disease severity labels for stage and plus disease assigned by the International Classification of Retinopathy of Prematurity, Third Edition (ICROP3), committee. Design: Validation study of an AI-based ROP vascular severity score. Participants: A total of 34 ROP experts from the ICROP3 committee. Methods: Two separate datasets of 30 fundus photographs each for stage (0-5) and plus disease (plus, preplus, neither) were labeled by members of the ICROP3 committee using an open-source platform. Averaging these results produced a continuous label for plus (1-9) and stage (1-3) for each image. Experts were also asked to compare each image to each other in terms of relative severity for plus disease. Each image was also labeled with a vascular severity score from the Imaging and Informatics in ROP deep learning system, which was compared with each grader's diagnostic labels for correlation, as well as the ophthalmoscopic diagnosis of stage. Main Outcome Measures: Weighted kappa and Pearson correlation coefficients (CCs) were calculated between each pair of grader classification labels for stage and plus disease. The Elo algorithm was also used to convert pairwise comparisons for each expert into an ordered set of images from least to most severe. Results: The mean weighted kappa and CC for all interobserver pairs for plus disease image comparison were 0.67 and 0.88, respectively. The vascular severity score was found to be highly correlated with both the average plus disease classification (CC = 0.90, P < 0.001) and the ophthalmoscopic diagnosis of stage (P < 0.001 by analysis of variance) among all experts. Conclusions: The ROP vascular severity score correlates well with the International Classification of Retinopathy of Prematurity committee member's labels for plus disease and stage, which had significant intergrader variability. Generation of a consensus for a validated scoring system for ROP SaMD can facilitate global innovation and regulatory authorization of these technologies. (c) 2022 Published by Elsevier Inc. on behalf of the American Academy of Ophthalmology

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据