4.5 Article

Visual Inspection Method for Metal Rolls Based on Multi-Scale Spatial Location Feature

期刊

SYMMETRY-BASEL
卷 14, 期 7, 页码 -

出版社

MDPI
DOI: 10.3390/sym14071291

关键词

warehouse management; deep learning; object detection; attention mechanisms; feature fusion

资金

  1. National Key Research and Development Program of Chinese Intelligent Robot [2018YFB1309000]
  2. National Natural Science Foundation of China [61973320]
  3. Liaoning Province State Key Laboratory of Robotics [2021KF2218]
  4. Youth Program of National Natural Science Foundation of China [61903138]
  5. Postgraduate Scientific Research Innovation Project of Hunan Province [QL20210048]
  6. Fundamental Research Funds for the Central Universities of Central South University

向作者/读者索取更多资源

This study proposes an efficient attention mechanism algorithm and a nonlinear feature fusion module for metal roll detection. A multi-scale object detection network is developed to accurately detect metal rolls. Experimental results demonstrate the effectiveness of the proposed network.
Metal rolls in a non-ferrous-metal manufacturing workshop manifest the characteristics of symmetry, multiple scales and mutual covering, which poses great challenges for metal roll detection. To solve this problem, firstly, an efficient attention mechanism algorithm named ECLAM (efficient capture location attendant model) is proposed for capturing spatial position features efficiently, to obtain complete location information for metal rolls in a complex environment. ECLAM can improve the ability to extract the spatial features of backbone networks and reduce the influence of the non-critical background. In addition, in order to give feature maps a larger receptive field and improve the weight of location information in multi-scale feature maps, a nonlinear feature fusion module named LFFM (location feature fusion module) is used to fuse two adjacent feature images. Finally, a multi-scale object detection network named L-MSNet (location-based multi-scale object detection network) based on the combination of ECLAM and LFFM is proposed and used to accurately detect multi-scale metal rolls. In the experiments, multi-scale metal roll images are collected from an actual non-ferrous-metal manufacturing workshop. On this basis, a pixel-level image dataset is constructed. Comparative experiments show that, compared with other object detection methods, L-MSNet can detect multi-scale metal rolls more accurately. The average accuracy is improved by 2% to 5%, and the average accuracy of small and medium-sized objects is also significantly improved by 3% to 6%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据