4.7 Article

Learning to Segment From Scribbles Using Multi-Scale Adversarial Attention Gates

期刊

IEEE TRANSACTIONS ON MEDICAL IMAGING
卷 40, 期 8, 页码 1990-2001

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMI.2021.3069634

关键词

Image segmentation; Annotations; Shape; Training; Logic gates; Semantics; Gallium nitride; Weak supervision; scribbles; segmentation; GAN; attention; shape Priors

资金

  1. Erasmus+ Programme of the European Union

向作者/读者索取更多资源

This study explores image segmentation in medical imaging using scribble annotations, with the use of multi-scale GAN for generating segmentation masks and a novel attention gating mechanism for improved object localization. Experimental results demonstrate that the model performs comparably to fully annotated models on medical and non-medical datasets, with potential extensions to various settings.
Large, fine-grained image segmentation datasets, annotated at pixel-level, are difficult to obtain, particularly in medical imaging, where annotations also require expert knowledge. Weakly-supervised learning can train models by relying on weaker forms of annotation, such as scribbles. Here, we learn to segment using scribble annotations in an adversarial game. With unpaired segmentation masks, we train a multi-scale GAN to generate realistic segmentation masks at multiple resolutions, while we use scribbles to learn their correct position in the image. Central to the model's success is a novel attention gating mechanism, which we condition with adversarial signals to act as a shape prior, resulting in better object localization at multiple scales. Subject to adversarial conditioning, the segmentor learns attention maps that are semantic, suppress the noisy activations outside the objects, and reduce the vanishing gradient problem in the deeper layers of the segmentor. We evaluated our model on several medical (ACDC, LVSC, CHAOS) and non-medical (PPSS) datasets, and we report performance levels matching those achieved by models trained with fully annotated segmentation masks. We also demonstrate extensions in a variety of settings: semi-supervised learning; combining multiple scribble sources (a crowdsourcing scenario) and multi-task learning (combining scribble and mask supervision). We release expert-made scribble annotations for the ACDC dataset, and the code used for the experiments, at https://vios-s.github.io/multiscale-adversarial-attention-gates.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据