☆ 4.7 Article

Lightweight multi-scale attention-guided network for real-time semantic segmentation

IMAGE AND VISION COMPUTING (2023)

期刊

IMAGE AND VISION COMPUTING

卷 139, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.imavis.2023.104823

关键词

Lightweight network; Attention mechanism; Multi-scale feature fusion; Real-time semantic segmentation

类别

Computer Science, Artificial Intelligence Computer Science, Software Engineering Computer Science, Theory & Methods Engineering, Electrical & Electronic Optics

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a lightweight multi-scale attention-guided network for real-time semantic segmentation (LMANet) based on asymmetric encoder-decoder. The network introduces multi-scale asymmetric residual (MAR) modules, attention feature fusion (AFF) module, and attention pyramid refining (APR) module to enhance feature expression and improve segmentation accuracy while maintaining reasonable inference speed and parameter quantity.

The wide application of small mobile devices makes the demand for lightweight real-time semantic segmentation algorithm become more and more intense, which makes it become one of the most popular research topics in the field of computer vision. However, some current methods blindly pursue low parameter numbers and high inference speeds, resulting in excessively low model accuracy and a loss of practical value. Therefore, a lightweight multi-scale attention-guided network for real-time semantic segmentation(LMANet) based on asymmetric encoder-decoder is proposed in this paper to solve the above dilemmas. In the encoder, we propose multi-scale asymmetric residual(MAR) modules to extract local spatial information and context information to enhance feature expression. In the decoder, we design an attention feature fusion(AFF) module and an attention pyramid refining (APR) module. AFF module guides the fusion of low-level and middle-level feature information through high-level semantic information, and finally refines the fusion result through APR module. In addition, we improve the segmentation performance of the model with the help of the attention modules in the network. Our network is tested on two complex urban road datasets. The experimental results show that LMANet achieves 70.6% mIoU and 66.5% mIoU on Cityscapes and Camvid datasets at 112FPS and 333FPS respectively, only 0.95 M parameters without any pre-training or pre-processing. Compared with most of existing state-of-the-art models, our network not only guarantees reasonable inference speed and parameter quantity, but also improves the accuracy as much as possible, which makes it more practical.

Lightweight multi-scale attention-guided network for real-time semantic segmentation

期刊

IMAGE AND VISION COMPUTING

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Lightweight multi-scale attention-guided network for real-time semantic segmentation

期刊

IMAGE AND VISION COMPUTING

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文