4.5 Article

A deep learning tool for fully automated measurements of sagittal spinopelvic balance from X-ray images: performance evaluation

Journal

EUROPEAN SPINE JOURNAL
Volume 29, Issue 9, Pages 2295-2305

Publisher

SPRINGER
DOI: 10.1007/s00586-020-06406-7

Keywords

Sagittal spinopelvic balance; Pelvic incidence; X-ray images; Computer-assisted tools; Deep learning

Funding

  1. Raylytic GmbH, Leipzig, Germany
  2. Slovenian Research Agency [P2-0232, J2-7118]
  3. German Research Foundation (DFG) [PU 510/2-1]

Ask authors/readers for more resources

Purpose The purpose of this study is to evaluate the performance of a novel deep learning (DL) tool for fully automated measurements of the sagittal spinopelvic balance from X-ray images of the spine in comparison with manual measurements. Methods Ninety-seven conventional upright sagittal X-ray images from 55 subjects were retrospectively included in this study. Measurements of the parameters of the sagittal spinopelvic balance, i.e., the sacral slope (SS), pelvic tilt (PT), spinal tilt (ST), pelvic incidence (PI) and spinosacral angle (SSA), were obtained manually by identifying specific anatomical landmarks using the SurgiMap Spine software and by the fully automated DL tool. Statistical analysis was performed in terms of the mean absolute difference (MAD), standard deviation (SD) and Pearson correlation, while the paired t test was used to search for statistically significant differences between manual and automated measurements. Results The differences between reference manual measurements and those obtained automatically by the DL tool were, respectively, for SS, PT, ST, PI and SSA, equal to 5.0 degrees (3.4 degrees), 2.7 degrees (2.5 degrees), 1.2 degrees (1.2 degrees), 5.5 degrees (4.2 degrees) and 5.0 degrees (3.5 degrees) in terms of MAD (SD), with a statistically significant corresponding Pearson correlation of 0.73, 0.90, 0.95, 0.81 and 0.71. No statistically significant differences were observed between the two types of measurement (p value always above 0.05). Conclusion The differences between measurements are in the range of the observer variability of manual measurements, indicating that the DL tool can provide clinically equivalent measurements in terms of accuracy but superior measurements in terms of cost-effectiveness, reliability and reproducibility.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available