4.6 Article

Quantification of Key Retinal Features in Early and Late Age-Related Macular Degeneration Using Deep Learning

Journal

AMERICAN JOURNAL OF OPHTHALMOLOGY
Volume 226, Issue -, Pages 1-12

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ajo.2020.12.034

Keywords

-

Categories

Funding

  1. Novartis Pharmaceuticals
  2. Automation in Medical Imaging project
  3. Fraunhofer Gesellschaft
  4. Radboud University
  5. University Medical Center
  6. German Research Foundation [MU4279/2-1]

Ask authors/readers for more resources

The study aimed to develop and validate a deep learning model for segmentation of 13 features associated with age-related macular degeneration. The results showed that the model's performance matches that of experienced observers for most features, and even exceeds human performance for some features.
PURPOSE: We sought to develop and validate a deep learning model for segmentation of 13 features associated with neovascular and atrophic age-related macular degeneration (AMD). DESIGN: Development and validation of a deep-learning model for feature segmentation. METHODS: Data for model development were obtained from 307 optical coherence tomography volumes. Eight experienced graders manually delineated all abnormalities in 2712 B-scans. A deep neural network was trained with these data to perform voxel-level segmentation of the 13 most common abnormalities (features). For evaluation, 112 B-scans from 112 patients with a diagnosis of neovascular AMD were annotated by 4 independent observers. The main outcome measures were Dice score, intraclass correlation coefficient, and free-response receiver operating characteristic curve. RESULTS: On 11 of 13 features, the model obtained a mean Dice score of 0.63 +/- 0.15, compared with 0.61 +/- 0.17 for the observers. The mean intraclass correlation coefficient for the model was 0.66 +/- 0.22, compared with 0.62 +/- 0.21 for the observers. Two features were not evaluated quantitatively because of a lack of data. Free-response receiver operating characteristic analysis demonstrated that the model scored similar or higher sensitivity per false positives compared with the observers. CONCLUSIONS: The quality of the automatic segmentation matches that of experienced graders for most features, exceeding human performance for some features. The quantified parameters provided by the model can be used in the current clinical routine and open possibilities for further research into treatment response outside clinical trials. ((C) 2021 The Authors. Published by Elsevier Inc.)

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available