4.6 Article

Active Mask-Box Scoring R-CNN for Sonar Image Instance Segmentation

期刊

ELECTRONICS
卷 11, 期 13, 页码 -

出版社

MDPI
DOI: 10.3390/electronics11132048

关键词

deep learning; active learning; instance segmentation; sonar image

资金

  1. National Natural Science Foundation of China [61871124, 61876037]
  2. fund of China ship development and design center [JJ-2021702-05]
  3. fund of national key Laboratory of science and technology on underwater acoustic antagonizing [2021-JCJQ-LB-033-09]

向作者/读者索取更多资源

This paper presents a novel method for instance segmentation of sonar images and embeds it in a deep active learning framework to address the mismatch between boxIoU and NMS score and the high annotation cost. The experimental results show significant improvements on the sonar image dataset, and the proposed method achieves competitive performance with fewer labeled samples compared to other frameworks.
Instance segmentation of sonar images is an effective method for underwater target recognition. However, the mismatch among positioning accuracy found by boxIoU and classification confidence, which is used as NMS score in current instance segmentation models; and the high annotation cost of sonar images, are two major problems in the task. To tackle these problems, in this paper, we present a novel instance segmentation method called Mask-Box Scoring R-CNN and embedded it in our proposed deep active learning framework. For the mismatch problem between boxIoU and NMS score, Mask-Box Scoring R-CNN uses a boxIoU head to predict the quality of the bounding boxes. We amend the non-maximum suppression (NMS) score predicted by BoxIoU to preserve high-quality bounding boxes in inference flow. To deal with the annotating problem, we propose a triplets-measure-based active learning (TBAL) method and a balanced-sampling method applicable for deep learning. The TBAL method evaluates the amount of information of unlabeled samples from the aspects of classification confidence, positioning accuracy, and mask quality. The balanced-sampling method selects hard samples from the dataset to train the model to improve performance. The experimental results show that Mask-Box Scoring R-CNN achieves improvements of 1% in boxAP and 1.3% boxAP on our sonar image dataset compared with Mask Scoring R-CNN and Mask R-CNN, respectively. The active learning framework with TBAL and balanced sampling can achieve a competitive performance with less labeled samples than other frameworks, which can better facilitate underwater target recognition.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据