4.7 Article

Latent Transformer Models for out-of-distribution detection

期刊

MEDICAL IMAGE ANALYSIS
卷 90, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.media.2023.102967

关键词

Transformers; Out-of-distribution detection; Segmentation; Uncertainty

向作者/读者索取更多资源

In this study, several segmentation methods with uncertainty were evaluated for the task of segmenting bleeds in 3D CT of the head. The results showed that these models can fail catastrophically in the far out-of-distribution domain, often providing highly confident but incorrect predictions. A method using a latent transformer model for out-of-distribution detection was proposed, which could identify images that are both far and near out-of-distribution, as well as provide spatial maps highlighting the regions considered to be out-of-distribution. Furthermore, a strong relationship between an image's likelihood and the quality of a model's segmentation on it was found, demonstrating the viability of this approach for filtering out unsuitable images.
Any clinically-deployed image-processing pipeline must be robust to the full range of inputs it may be presented with. One popular approach to this challenge is to develop predictive models that can provide a measure of their uncertainty. Another approach is to use generative modelling to quantify the likelihood of inputs. Inputs with a low enough likelihood are deemed to be out-of-distribution and are not presented to the downstream predictive model. In this work, we evaluate several approaches to segmentation with uncertainty for the task of segmenting bleeds in 3D CT of the head. We show that these models can fail catastrophically when operating in the far out-of-distribution domain, often providing predictions that are both highly confident and wrong. We propose to instead perform out-of-distribution detection using the Latent Transformer Model: a VQ-GAN is used to provide a highly compressed latent representation of the input volume, and a transformer is then used to estimate the likelihood of this compressed representation of the input. We demonstrate this approach can identify images that are both far-and near-out-of-distribution, as well as provide spatial maps that highlight the regions considered to be out-of-distribution. Furthermore, we find a strong relationship between an image's likelihood and the quality of a model's segmentation on it, demonstrating that this approach is viable for filtering out unsuitable images.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据