4.7 Article

Cross Position Aggregation Network for Few-Shot Strip Steel Surface Defect Segmentation

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIM.2023.3246519

关键词

Steel; Feature extraction; Strips; Semantic segmentation; Aggregates; Training; Surface morphology; Cross-position aggregation network (CPANet); few-shot learning; few-shot semantic segmentation (FSS); strip steel surface defect ((SD)-D-3) segmentation

向作者/读者索取更多资源

This article proposes a simple and effective few-shot segmentation method called CPANet, which aims to learn a network that can segment untrained S3D categories with only a few labeled defective samples. CPANet effectively aggregates long-range relationships of discrete defects using CPP and SA modules. It also introduces an SSA module to aggregate multiscale context information of defect features and suppresses interference from background information. Extensive experiments demonstrate that CPANet achieves state-of-the-art performance on the FSSD-12 dataset.
Strip steel surface defect (S3D) segmentation is a crucial method to inspect the surface quality of strip steel in the producing-and-manufacturing. However, existing S3D semantic segmentation methods depend on quite a few labeled defective samples for training, and generalization to novel defect categories that have not yet been trained is challenging. Additionally, some defect categories are incredibly sparse in the industrial production processes. Motivated by the above problems, this article proposed a simple but effective few-shot segmentation method named cross position aggregation network (CPANet), which intends to learn a network that can segment untrained S3D categories with only a few labeled defective samples. Using a cross-position proxy (CPP) module, our CPANet can effectively aggregate long-range relationships of discrete defects, and support auxiliary (SA) can further improve the feature aggregation capability of CPP. Moreover, CPANet introduces a space squeeze attention (SSA) module to aggregate multiscale context information of defect features and suppresses disadvantageous interference from background information. In addition, a novel S3D few-shot semantic segmentation (FSS) dataset FSSD-12 is proposed to evaluate our CPANet. Through extensive comparison experiments and ablation experiments, we explicitly evaluate that our CPANet with the ResNet-50 backbone achieves state-of-theart performance on dataset FSSD-12. Our dataset and code are available at (https://github.com/VDT-2048/CPANet).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据