4.7 Article

Efficient Hybrid Supervision for Instance Segmentation in Aerial Images

期刊

REMOTE SENSING
卷 13, 期 2, 页码 -

出版社

MDPI
DOI: 10.3390/rs13020252

关键词

hybrid supervision; instance segmentation; aerial images; labeling cost

资金

  1. National Natural Science Foundation of China [61936011, 61871039, 62006101]
  2. Collaborative Innovation Center for Visual Intelligence [CYXC2011]

向作者/读者索取更多资源

The study proposes a pipeline of hybrid supervision, presenting an ancillary segmentation model to generate accurate pseudo pixel-wise labels for addressing challenges in aerial image instance segmentation. Experimental results show that the proposed method can achieve comparable performance on the iSAID dataset, outperforming weakly supervised settings.
Instance segmentation in aerial images is of great significance for remote sensing applications, and it is inherently more challenging because of cluttered background, extremely dense and small objects, and objects with arbitrary orientations. Besides, current mainstream CNN-based methods often suffer from the trade-off between labeling cost and performance. To address these problems, we present a pipeline of hybrid supervision. In the pipeline, we design an ancillary segmentation model with the bounding box attention module and bounding box filter module. It is able to generate accurate pseudo pixel-wise labels from real-world aerial images for training any instance segmentation models. Specifically, bounding box attention module can effectively suppress the noise in cluttered background and improve the capability of segmenting small objects. Bounding box filter module works as a filter which removes the false positives caused by cluttered background and densely distributed objects. Our ancillary segmentation model can locate object pixel-wisely instead of relying on horizontal bounding box prediction, which has better adaptability to arbitrary oriented objects. Furthermore, oriented bounding box labels are utilized for handling arbitrary oriented objects. Experiments on iSAID dataset show that the proposed method can achieve comparable performance (32.1 AP) to fully supervised methods (33.9 AP), which is obviously higher than weakly supervised setting (26.5 AP), when using only 10% pixel-wise labels.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据