4.4 Article

A self-distillation object segmentation method via frequency domain knowledge augmentation

期刊

IET COMPUTER VISION
卷 17, 期 3, 页码 341-351

出版社

WILEY
DOI: 10.1049/cvi2.12170

关键词

computer vision; convolutional neural nets; image segmentation

向作者/读者索取更多资源

This paper proposes a self-distillation object segmentation method via frequency domain knowledge augmentation, which does not require complex auxiliary teacher structures and a large number of training samples. By constructing an object segmentation network that efficiently integrates multi-level features, a pixel-wise virtual teacher generation model is proposed to transfer pixel-wise knowledge to the object segmentation network through self-distillation learning, thereby improving its generalization ability. A frequency domain knowledge adaptive generation method is used to augment data, dynamically adjusting the learnable pixel-wise quantization table with a differentiable quantization operator. Experimental results show that the proposed method effectively enhances the performance of the object segmentation network, outperforming recent self-distillation methods, and achieving an average F-beta and mIoU increase of about 1.5% and 3.6% compared to a typical feature refinement self-distillation method.
Most self-distillation methods need complex auxiliary teacher structures and require lots of training samples in object segmentation task. To solve this challenging, a self-distillation object segmentation method via frequency domain knowledge augmentation is proposed. Firstly, an object segmentation network which efficiently integrates multi-level features is constructed. Secondly, a pixel-wise virtual teacher generation model is proposed to drive the transferring of pixel-wise knowledge to the object segmentation network through self-distillation learning, so as to improve its generalisation ability. Finally, a frequency domain knowledge adaptive generation method is proposed to augment data, which utilise differentiable quantisation operator to adjust the learnable pixel-wise quantisation table dynamically. What's more, we reveal convolutional neural network is more inclined to learn low-frequency information during the train. Experiments on five object segmentation datasets show that the proposed method can enhance the performance of the object segmentation network effectively. The boosting performance of our method is better than recent self-distillation methods, and the average F-beta and mIoU are increased by about 1.5% and 3.6% compared with typical feature refinement self-distillation method.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据