4.5 Article

YOLOv3-MT: A YOLOv3 using multi-target tracking for vehicle visual detection

期刊

APPLIED INTELLIGENCE
卷 52, 期 2, 页码 2070-2091

出版社

SPRINGER
DOI: 10.1007/s10489-021-02491-3

关键词

Object detection; Class-wise k-means clustering; Occluded environment; YOLOv3; Kalman filter

资金

  1. National Natural Science Foundation of China
  2. Central University basic scientific research business fee project of Civil Aviation University of China [3122018C001]

向作者/读者索取更多资源

This study presents a method based on YOLOv3, utilizing AS-CBAM and HDC to improve detection accuracy and achieve real-time warnings for completely occluded objects. It introduces DIOU-NMS and a tracking algorithm based on Kalman filtering and Hungarian matching to enhance the system's ability to recognize occluded objects. Compared to traditional YOLOv3, the proposed method increases mAP and shows a faster processing speed and significant improvement in accuracy on datasets focusing on occlusion conditions.
During automatic driving, the complex background and mutual occlusion between multiple targets hinder the correct judgment of the detector and miss detection. When a close-range target is captured again, the vehicle may not be able to respond in time and cause a fatal accident. Therefore, in the application of auxiliary systems, a model that can accurately identify partially occluded targets in complex backgrounds and perform short-term tracking and early warning of completely occluded objects is required. This paper proposes a method to improve detection accuracy while supporting real-time operations based on YOLOv3 and realize real-time warnings for those objects that are completely blocked. First, we obtain a more suitable prior frames setting through class-wise K-means clustering. To solve the problem that the maxpool operation of original CBAM easily introduces background noise, we proposed AS-CBAM(Adaptive Selection Convolutional Block Attention Module) and innovatively combined the HDC(Hybrid Dilated Convolution) to maximize the receptive field and fine-tune the characteristics. The 1x1 convolution operation is used to suppress the increase of the parameter amount. In this study, DIOU-NMS was used to replace traditional NMS. Besides, a tracking algorithm based on Kalman filtering and Hungarian matching is introduced to improve the system's ability to recognize occluded objects. Compared with the traditional YOLOv3, the proposed method can increase the mAP by 1.32% and 1.47% on KITTI and UA-DETRAC, respectively. Nevertheless, it shows a processing speed of 35.07FPS and a more significant improvement in accuracy (90.36% vs. 85.71%) on the Object-Mask, a dataset that focuses on occlusion conditions. Therefore, the proposed algorithm is more suitable for autonomous driving applications.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据