☆ 4.5 Article

MGSNet: A multi-scale and gated spatial attention network for crowd counting

APPLIED INTELLIGENCE (2022)

期刊

APPLIED INTELLIGENCE

卷 52, 期 13, 页码 15436-15446

出版社

SPRINGER

DOI: 10.1007/s10489-022-03263-3

关键词

Crowd counting; Density estimation; Spatial attention; Gated; Multi-scale

类别

Computer Science, Artificial Intelligence

资金

National Natural Science Foundation of China [61971073]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a multi-scale and gated spatial attention network (MGSNet) for crowd counting, which aims to address the challenges of large-scale variation and complex background noise. Experimental results demonstrate its effectiveness over state-of-the-art approaches on multiple datasets.

Recently, crowd counting via estimating a density map has been widely studied. However, it still has a variety of issues to overcome, such as large-scale variation of population, complex background noise, perspective distortion, etc. The large-scale variation of heads will restrict the performance of crowd counting approaches, and the complex background noise will result in the background, such as leaf and mesh, being incorrectly recognized as heads. To maintain large-scale variation and generate a high-quality estimated density map, we propose a novel multi-scale fusion scale-aware attention network called multi-scale and gated spatial attention network (MGSNet). In MGSNet, the first 10 layers of VGG16 with Batch Normalization (BN) are utilized as backbone. Then, two branches, i.e., a large-scale branch and a scale-aware attention branch, are followed. The large-scale branch is used to overcome the large-scale variation of heads in crowd images, in which a Scale Information Aggregation Block (SIAB) is employed to extract multi-scale features by utilizing dilated convolution with different receptive fields. The scale-aware attention branch is used to address complex background noise in crowd scenes, in which a Gated Spatial Attention Block (GSAB) inspired by the Long Short-term Memory Networks (LSTM) is employed to fuse the previous information with different scales and retain the appropriate scale information of crowds. We demonstrate our proposed method on the ShanghaiTech (Part AB), UCF-CC-50 and UCF-QNRF datasets. The experimental results show its effectiveness over the state-of-the-art.

MGSNet: A multi-scale and gated spatial attention network for crowd counting

期刊

APPLIED INTELLIGENCE

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

MGSNet: A multi-scale and gated spatial attention network for crowd counting

期刊

APPLIED INTELLIGENCE

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文