☆ 4.7 Article

Few-Shot Segmentation via Divide-and-Conquer Proxies

INTERNATIONAL JOURNAL OF COMPUTER VISION (2023)

期刊

INTERNATIONAL JOURNAL OF COMPUTER VISION

卷 -, 期 -, 页码 -

出版社

SPRINGER

DOI: 10.1007/s11263-023-01886-8

关键词

Few-Shot learning; Few-Shot segmentation; Semantic segmentation; Prototype learning

类别

Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Few-Shot segmentation (FSS) is a challenging task that aims to identify unseen classes with only a few annotated samples. Current approaches based on prototype learning fail to fully utilize support image-mask pairs, leading to segmentation failures. To address this, we propose a flexible framework that divides the segmentation mask, generates support-induced proxies, and incorporates parallel decoding and semantic consistency regularization.

Few-Shot segmentation (FSS) is a marginally explored but challenging task that aims to identify unseen classes of objects with only a handful of densely annotated samples. By and large, current FSS approaches perform meta-inference based on the prototype learning paradigm, which fails to fully exploit the underlying information from support image-mask pairs, resulting in multiple segmentation failures, such as incomplete objects, ambiguous boundaries, and distractor activation. For this purpose, a flexible and generic framework is developed in the spirit of divide-and-conquer. We first implement a novel self-reasoning scheme on the labeled support image, and then divide the coarse segmentation mask into several regions with different properties. By employing effective masked average pooling techniques, a series of support-induced proxies are generated on the fly, each performing a specific role in conquering the above challenges. Furthermore, we meticulously devise the parallel decoder structure and semantic consistency regularization to eliminate confusion and enhance discrimination. In stark contrast to conventional prototype-based approaches, our proposed divide-and-conquer proxies (DCP) can provide episode level guidelines that go well beyond the object cues themselves. Extensive experiments are conducted on FSS benchmarks to verify the effectiveness, including standard settings as well as cross-domain settings. In particular, we propose a temporal DCP and successfully extend it to video object segmentation via memory repository and progressive propagation, illustrating the high scalability. The source codes are available at https://github.com/chunbolang/DCP.

Few-Shot Segmentation via Divide-and-Conquer Proxies

期刊

INTERNATIONAL JOURNAL OF COMPUTER VISION

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Few-Shot Segmentation via Divide-and-Conquer Proxies

期刊

INTERNATIONAL JOURNAL OF COMPUTER VISION

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文