4.5 Article

Recognize highly similar sewing gestures by the robot

期刊

出版社

SAGE PUBLICATIONS LTD
DOI: 10.1177/15589250221077267

关键词

Sewing gestures; target detection; feature extraction; behavior recognition

资金

  1. Natural Science Foundation of China [51905405]
  2. Key Research and Development plan of Shaanxi province China [2019ZDLGY01-08]
  3. Ministr of Education Engineering Science and Technology Talent Training Research Project of China [18JDGC029]
  4. Innovation Capability Support Program of Shaanxi [2021TD-29]
  5. Key Research and Development program of Shaanxi province [2022GY-276]

向作者/读者索取更多资源

The paper proposes three deep learning-based models for sewing gesture detection, which improve accuracy and speed by enhancing network structure and feature extraction methods. The performance of each model is compared and analyzed.
The autonomous and efficient learning of sewing gestures by robots will bring great convenience to the garment industry. To improve the accuracy of robots in detecting sewing gestures with high similarity, three detection models based on deep learning are proposed in the paper. First, in order to improve the detection accuracy and detection speed of sewing gestures under complex backgrounds, we added a dense connection layer to the low-resolution network layer of YOLO-V3 to enhance the transmission and reuse rate of image features. Secondly, a deeper ResNet50 residual network is introduced to replace the VGG16 basic network in the original SSD model. The feature pyramid structure is used to fuse high-level semantic features and low-level semantic features, which can improve the detection accuracy of small-sized sewing gestures. Finally, the parallel spatial-temporal dual-stream network separately extracts the temporal feature and the spatial feature of sewing gestures. The fusion of time feature and space feature improves the detection accuracy of the coherent sewing gesture. The results show that the suggested three models can effectively detect four sewing gestures with high similarity. Among them, the spatial-temporal two-stream convolutional neural network has the highest detection accuracy. The improved SSD model has faster detection speed than the improved YOLO-V3 model and other mainstream algorithms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据