4.7 Article

Remote Sensing Image Target Detection: Improvement of the YOLOv3 Model with Auxiliary Networks

期刊

REMOTE SENSING
卷 13, 期 19, 页码 -

出版社

MDPI
DOI: 10.3390/rs13193908

关键词

YOLOv3; Distance-IoU; convolutional block attention module; adaptive feature fusion; DOTA dataset

资金

  1. National Natural Science Foundation China [61601174]
  2. Postdoctoral Research Foundation of Heilongjiang Province [LBH-Q17150]
  3. Science and Technology Innovative Research Team in Higher Educational Institutions of Heilongjiang Province [2012TD007]
  4. Fundamental Research Funds for the Heilongjiang Provincial Universities [KJCXZD201703]
  5. Science Foundation of Heilongjiang Province of China [F2018026]

向作者/读者索取更多资源

The paper aims to improve the performance of the YOLO algorithm in remote sensing image target detection by further enhancing the YOLOv3 model, achieving improved detection speed and accuracy. The experimental results demonstrate that the optimized network model outperforms the original YOLOv3 model with the auxiliary network, with a higher mAP and increased detection frame rate.
Remote sensing image target detection is widely used for both civil and military purposes. However, two factors need to be considered for remote sensing image target detection: real-time and accuracy for detecting targets that occupy few pixels. Considering the two above issues, the main research objective of this paper is to improve the performance of the YOLO algorithm in remote sensing image target detection. The reason is that the YOLO models can guarantee both detection speed and accuracy. More specifically, the YOLOv3 model with an auxiliary network is further improved in this paper. Our model improvement consists of four main components. Firstly, an image blocking module is used to feed fixed size images to the YOLOv3 network; secondly, to speed up the training of YOLOv3, DIoU is used, which can speed up the convergence and increase the training speed; thirdly, the Convolutional Block Attention Module (CBAM) is used to connect the auxiliary network to the backbone network, making it easier for the network to notice specific features so that some key information is not easily lost during the training of the network; and finally, the adaptive feature fusion (ASFF) method is applied to our network model with the aim of improving the detection speed by reducing the inference overhead. The experiments on the DOTA dataset were conducted to validate the effectiveness of our model on the DOTA dataset. Our model can achieve satisfactory detection performance on remote sensing images, and our model performs significantly better than the unimproved YOLOv3 model with an auxiliary network. The experimental results show that the mAP of the optimised network model is 5.36% higher than that of the original YOLOv3 model with the auxiliary network, and the detection frame rate was also increased by 3.07 FPS.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据