4.7 Article

Fusion of Mask RCNN and attention mechanism for instance segmentation of apples under complex background

期刊

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.compag.2022.106864

关键词

Apple; Target segmentation; Deep learning; Mask RCNN; Attention mechanism

资金

  1. Talent Introduction Program of Xian University of Science and Technology [2050121002]
  2. Natural Science Basic Research Program of Shaanxi [2022JQ-186]

向作者/读者索取更多资源

This study developed a precise apple instance segmentation method based on an improved Mask RCNN, which achieved accurate apple segmentation under various conditions and demonstrated near real-time performance. The method outperformed other comparison methods and laid the foundation for accurate fruit detection and long-term automatic growth monitoring.
It is important to precisely segment apples in an orchard during the growth period to obtain accurate growth information. However, the complex environmental factors and growth characteristics, such as fluctuating illumination, overlapping and occlusion of apples, the gradual change in the ground colour of apples from green to red, and the similarities between immature apples and background leaves, affect apple segmentation accuracy. The purpose of this study was to develop a precise apple instance segmentation method based on an improved Mask region-based convolutional neural network (Mask RCNN). An existing Mask RCNN model was improved by fusing an attention module into the backbone network to enhance its feature extraction ability. A combination of deformable convolution and the transformer attention with the key content only term was used as the attention module in this study. The experimental results showed that the improved Mask RCNN can accurately segment apples under various conditions, such as apples with shadows and different ground colours, overlapped apples, and apples occluded by branches and leaves. A recall, precision, F1 score, and segmentation mAP of 97.1%, 95.8%, 96.4% and 0.917, respectively, were achieved, and the average run-time on the test set was 0.25 s per image. Our method outperformed the two methods in comparison, indicating that it can accurately segment apples in the growth stage with a near real-time performance. This study lays the foundation for realizing accurate fruit detection and long-term automatic growth monitoring.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据