☆ 4.7 Article

Reverse Classification Accuracy: Predicting Segmentation Performance in the Absence of Ground Truth

IEEE TRANSACTIONS ON MEDICAL IMAGING (2017)

Journal

IEEE TRANSACTIONS ON MEDICAL IMAGING

Volume 36, Issue 8, Pages 1597-1606

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TMI.2017.2665165

Keywords

Abdominal; classification; image segmentation; machine learning; MRI; performance evaluation

Funding

NVIDIA Corporation
EPSRC [EP/N023668/1] Funding Source: UKRI
Engineering and Physical Sciences Research Council [EP/N023668/1] Funding Source: researchfish
National Institute for Health Research [13/122/01] Funding Source: researchfish

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

When integrating computational tools, such as automatic segmentation, into clinical practice, it is of utmost importance to be able to assess the level of accuracy on new data and, in particular, to detect when an automatic method fails. However, this is difficult to achieve due to the absence of ground truth. Segmentation accuracy on clinical data might be different from what is found through cross validation, because validation data are often used during incremental method development, which can lead to overfitting and unrealistic performance expectations. Before deployment, performance is quantified using different metrics, for which the predicted segmentation is comparedwith a reference segmentation, often obtained manually by an expert. But little is known about the real performance after deployment when a reference is unavailable. In this paper, we introduce the concept of reverse classification accu-racy (RCA) as a framework for predicting the performance of a segmentation method on new data. In RCA, we take the predicted segmentation from a new image to train a reverse classifier, which is evaluated on a set of reference images with available ground truth. The hypothesis is that if the predicted segmentation is of good quality, then the reverse classifier will perform well on at least some of the reference images. We validate our approach on multi-organ segmentation with different classifiers and segmentation methods. Our results indicate that it is indeed possible to predict the quality of individual segmentations, in the absence of ground truth. Thus, RCA is ideal for integration into automatic processing pipelines in clinical routine and as a part of large-scale image analysis studies.

Reverse Classification Accuracy: Predicting Segmentation Performance in the Absence of Ground Truth

Journal

IEEE TRANSACTIONS ON MEDICAL IMAGING

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Reverse Classification Accuracy: Predicting Segmentation Performance in the Absence of Ground Truth

Journal

IEEE TRANSACTIONS ON MEDICAL IMAGING

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper