4.7 Article

Lightweight tomato real-time detection method based on improved YOLO and mobile deployment

期刊

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.compag.2023.107625

关键词

YOLO; MobileNetV3; Tomato detection; Model pruning; Model Quantification

向作者/读者索取更多资源

In this study, a lightweight improved YOLOv5 based algorithm was proposed for real-time localization and ripeness detection of tomato fruits. The algorithm reconstructed the backbone network of YOLOv5 using a down-sampling convolutional layer and the bneck module of MobileNetV3, and further reduced the model size by performing channel pruning for the neck layer. Experimental results showed significant improvements in terms of parameter compression, model size reduction, and detection speed on the CPU platform.
The current deep-learning-based tomato target detection algorithm has many parameters; it has drawbacks of large computation, long time consumption, and reliance on high-computing-power devices such as graphics processing units (GPU). In this study, we propose a lightweight improved YOLOv5 (You Only Look Once) based algorithm to achieve real-time localization and ripeness detection of tomato fruits. Initially, this algorithm used a down-sampling convolutional layer instead of the original focus layer, reconstructing the backbone network of YOLOv5 using the bneck module of MobileNetV3. Then, it performs channel pruning for the neck layer to further reduce the model size and uses a genetic algorithm for hyperparameter optimization to improve detection ac-curacy. We evaluate the improved algorithm using a homemade tomato dataset. The experimental results demonstrated that the improved model number of parameters and floating point operations per second (FLOPs) were compressed by 78% and 84.15% compared to the original YOLOv5s, while the mAP reached 0.969. Meanwhile, the detection speed on the central processing unit (CPU) platform was 42.5 ms, which was 64.88% better. This study further utilized the Nihui convolutional neural network (NCNN) framework to quantize the improved model and developed an Android-based real-time tomato monitoring application (app). Experimental results demonstrated that the 16-bit quantized model achieved an average detection frame rate of 26.5 frames per second (fps) on the mobile side with lower arithmetic power, which was 268% better than the original YOLOv5s, and the model size was reduced by 51.1% while achieving a 93% true detection rate.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据