4.5 Article

FPANet: Feature pyramid aggregation network for real-time semantic segmentation

期刊

APPLIED INTELLIGENCE
卷 52, 期 3, 页码 3319-3336

出版社

SPRINGER
DOI: 10.1007/s10489-021-02603-z

关键词

Real-time; Feature pyramid network; Atrous spatial pyramid pooling; feature fusion; Border refinement

资金

  1. National Natural Science Foundation of China [61662009, 61772008]
  2. Key Program of the National Natural Science Union Foundation of China [U1836205]

向作者/读者索取更多资源

This study presents the FPANet model, which uses the novel SeBiFPN and lightweight feature pyramid fusion module, and addresses border segmentation issues through BRM, achieving high-quality real-time semantic segmentation with a better balance of speed and accuracy compared to state-of-the-art methods.
Semantic segmentation is used in many fields, and most fields not only require models with high-quality predictions but also require real-time speed in the forward inference phase. Therefore, our goal is to perform high-quality real-time semantic segmentation, thus proposing the feature pyramid aggregation network (FPANet). This network can be regarded as an encoder-decoder model. In the encoder stage, we use ResNet and atrous spatial pyramid pooling (ASPP) to extract more high-level semantic information. In the decoder stage, to simultaneously obtain the semantic and spatial information of the image, we propose a bilateral directional feature pyramid network for semantic segmentation to fuse features at different levels, it is named SeBiFPN. In SeBiFPN, we design a lightweight feature pyramid fusion module (FPFM) to fuse features from two different levels. In addition, when predicting the border region of an image, most real-time semantic segmentation models perform poorly; therefore, we propose a border refinement module (BRM) to improve the problem of inaccurate border segmentation. To reduce the computational complexity of the model, we redesign the ASPP module and reduce the number of feature channels during feature fusion. Our method achieves a better balance of speed and accuracy compared to the state-of-the-art methods on the Cityscapes and CamVid datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据