4.6 Article

Object Detection for Industrial Applications: Training Strategies for AI-Based Depalletizer

期刊

APPLIED SCIENCES-BASEL
卷 12, 期 22, 页码 -

出版社

MDPI
DOI: 10.3390/app122211581

关键词

machine learning; deep learning; CNN; artificial intelligence; robotics; object detection; industrial depalletization

资金

  1. Polytechnic University of Bari
  2. Comau(R) S.p.A. company

向作者/读者索取更多资源

The demand for robot-based depalletization systems has been increasing in recent years due to the growth of logistics, storage, and supply chain sectors. Classical depalletization systems are being replaced by innovative solutions based on 2D/3D vision and deep learning methods to handle unstructured scenarios. This study compares different training strategies to customize an object detection model and validates the effectiveness of fine-tuning a pre-trained CNN model for this task.
In the last 10 years, the demand for robot-based depalletization systems has constantly increased due to the growth of sectors such as logistics, storage, and supply chains. Since the scenarios are becoming more and more unstructured, characterized by unknown pallet layouts and stockkeeping unit shapes, the classical depalletization systems based on the knowledge of predefined positions within the pallet frame are going to be substituted by innovative and robust solutions based on 2D/3D vision and Deep Learning (DL) methods. In particular, the Convolutional Neural Networks (CNNs) are deep networks that have proven to be effective in processing 2D/3D images, for example in the automatic object detection task, and robust to the possible variability among the data. However, deep neural networks need a big amount of data to be trained. In this context, whenever deep networks are involved in object detection for supporting depalletization systems, the dataset collection represents one of the main bottlenecks during the commissioning phase. The present work aims at comparing different training strategies to customize an object detection model aiming at minimizing the number of images required for model fitting, while ensuring reliable and robust performances. Different approaches based on a CNN for object detection are proposed, evaluated, and compared in terms of the Fl-score. The study was conducted considering different starting conditions in terms of the neural network's weights, the datasets, and the training set sizes. The proposed approaches were evaluated on the detection of different kinds of paper boxes placed on an industrial pallet. The outcome of the work validates that the best strategy is based on fine-tuning of a CNN-based model already trained on the detection of paper boxes, with a median F1-score greater than 85.0%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据