4.7 Article

Object detection/tracking toward underwater photographs by remotely operated vehicles (ROVs)

出版社

ELSEVIER
DOI: 10.1016/j.future.2021.07.011

关键词

Object detection; Underwater; Remotely operated vehicles; YOLO; Segmentation

资金

  1. Natural Science Foundation of Heilongjiang Province [LH2021E045]

向作者/读者索取更多资源

This paper proposes an underwater target detection method based on the weighted YOLO network, combined with an adaptive dimensional clustering algorithm and speckle detection technology, achieving accurate detection of small underwater targets. The combination of background subtraction and three-frame difference effectively improves the detection of moving object pixels.
Owing to the shortcoming of image blur, scale diversity, and complex backgrounds, underwater target detection and tracking is a tough challenge in real-world ROVs applications. In this paper, an underwater target detection method based on the class-weighted YOLO network is proposed. The key technique is to construct the class-weighted loss function based on the deep network YOLO to balance the sample difficulty and achieve better results. In addition, an adaptive dimensional clustering algorithm of target box is introduced to further improve the detection performance. Firstly, according to the imaging characteristics of the small underwater target in the side-scan sonar image, the sonar image is filtered and segmented by clustering to significantly reduce noises in each image. Afterward, the speckle detection is deployed to extract the suspected target area in the side-scan sonar image. Finally, the sonar image is segmented according to the automatic threshold segmentation and receive the binary image of the target region. The target scale is estimated by the second moment and the false target is eliminated. Correspondingly, the accurate detection of small underwater target can be achieved. The moving object pixel is detected by background difference and three-frame difference respectively. On this basis, we combine the background difference and the three-frame difference. The background difference characterizes the objects capturing the incomplete message detected by the three-frame difference. Finally, morphological operation is leveraged to eliminate the noise caused by non-static background objects. Experiments on multiple ROVs have validated that, the proposed algorithm can improve the average accuracy and the recall rate remarkably in the dense target detection task that contains 20 targets averagely. (C) 2021 Published by Elsevier B.V.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据