4.3 Article

Practical quantification of image registration accuracy following the AAPM TG-132 report framework

Journal

JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS
Volume 19, Issue 4, Pages 125-133

Publisher

WILEY
DOI: 10.1002/acm2.12348

Keywords

deformable image registration; rigid image registration; validation of image registration

Ask authors/readers for more resources

The AAPM TG 132 Report enumerates important steps for validation of the medical image registration process. While the Report outlines the general goals and criteria for the tests, specific implementation may be obscure to the wider clinical audience. We endeavored to provide a detailed step-by-step description of the quantitative tests' execution, applied as an example to a commercial software package (Mirada Medical, Oxford, UK), while striving for simplicity and utilization of readily available software. We demonstrated how the rigid registration data could be easily extracted from the DICOM registration object and used, following some simple matrix math, to quantify accuracy of rigid translations and rotations. The options for validating deformable image registration (DIR) were enumerated, and it was shown that the most practically viable ones are comparison of propagated internal landmark points on the published datasets, or of segmented contours that can be generated locally. The multimodal rigid registration in our example did not always result in the desired registration error below 1/2 voxel size, but was considered acceptable with the maximum errors under 1.3 mm and 1 degrees. The DIR target registration errors in the thorax based on internal landmarks were far in excess of the Report recommendations of 2 mm average and 5 mm maximum. On the other hand, evaluation of the DIR major organs' contours propagation demonstrated good agreement for lung and abdomen (Dice Similarity Coefficients, DSC, averaged over all cases and structures of 0.92 +/- 0.05 and 0.91 +/- 0.06, respectively), and fair agreement for Head and Neck (average DSC = 0.73 +/- 0.14). The average for head and neck is reduced by small volume structures such as pharyngeal constrictor muscles. Even these relatively simple tests show that commercial registration algorithms cannot be automatically assumed sufficiently accurate for all applications. Formalized task-specific accuracy quantification should be expected from the vendors.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available