☆ 4.8 Article

Multi-scale YOLACT for instance segmentation

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES (2022)

期刊

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES

卷 34, 期 10, 页码 9419-9427

出版社

ELSEVIER

DOI: 10.1016/j.jksuci.2022.09.019

关键词

Instance segmentation; The response of prototype mask; Multi-scale features; Semantic information; Detailed information

类别

Computer Science, Information Systems

资金

National Natural Science Foundation of China [61763033, 61866028, 61663031, 61866025]
Jiangxi Provincial Key Program Project of Research and Development [20203BBGL73222]
Technology Innovation Guidance Program Project (Special Project of Technology Cooperation, Science and Technology Department of Jiangxi Province) [20212BDH81003]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a multi-scale instance segmentation method that improves network performance and segmentation accuracy by enhancing the response of prototype masks. The experimental results demonstrate that the method achieves higher segmentation accuracy while maintaining a relatively high speed.

The mainstream instance segmentation is a comprehensive computer vision task, which involves computer vision tasks such as image classification, object detection, and semantic segmentation. Aiming at the Prototype mask for initial segmentation mask with incorrect segmentation, this paper uses YOLACT (You Only Look at CoefficienTs) as the benchmark, in order to improve the network performance in the interfernce situation by enhancing the response of prototype mask, the multi-scale YOLACT for instance segmentation (MS YOLACT) is proposed, which increases the accuracy of segmentation by designing a lightweight network structure. First, the image gets multi-scale features through the residual network and the feature pyramid network. Then, the deep up-sampling and shallow down-sampling in the multi-scale feature layer are realized respectively to the size required by the prototype mask branch input, and all the deep information that has been up-sampled is further learned by convolution. Finally, at the input end of the prototype mask branch, the deep information and the shallow information are sequentially merged in an additive manner to improve the response of the prototype mask, thereby improving the accuracy of the target's mask segmentation. The experimental results show that compared with the benchmark on COCO test-dev, when the speed is reduced by only 1 FPS, the overall segmentation accuracy reaches an improvement of 0.6, and the segmentation accuracy of small and large targets reaches an improvement of 0.4 and 0.7 respectively; the visualization results also show that the segmentation mask of MS YOLACT is more accurate. In addition, MS YOLACT has the advantages of higher speed and lower requirements on equipment. (c) 2022 The Author(s). Published by Elsevier B.V. on behalf of King Saud University.

Multi-scale YOLACT for instance segmentation

期刊

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multi-scale YOLACT for instance segmentation

期刊

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文