4.7 Article

Hierarchical pyramid attentive network with spatial separable convolution for crowd counting

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.engappai.2021.104563

关键词

Crowd counting; Scale-aware attentive context; Hierarchical feature fusion; Scale variation

资金

  1. National Natural Science Foundation of China [62173290]
  2. Central Government Guided Local Funds for Science and Technology Development, China [216Z0301G]
  3. Natural Science Foundation of Hebei province in China [F2019203285, F2019203526]
  4. Department of Education of Hebei Province, China [ZD2020118]
  5. Qinhuangdao Science and Technology Bureau, China [201902A215]
  6. Postgraduate Innovation Fund Project of Hebei Province, China [CXZZBS2021126]

向作者/读者索取更多资源

This paper introduces a novel method based on HPANet to address the challenging issue of scale variation in crowd counting tasks, improving counting accuracy. By designing SPA blocks and an HFF module, HPANet provides a powerful scale-aware feature representation and demonstrates effective performance on public benchmark datasets.
To tackle the challenging scale variation issue of the crowd counting task so as to improve the counting accuracy, we present a novel method based on Hierarchical Pyramid Attentive Network (HPANet) for crowd counting. Specifically, a Scale-aware Pyramid Attentive (SPA) block, extracting the rich multi-scale context, is designed elaborately as using the two-branch spatial separable convolution as its core component to replace the conventional pure convolution with larger kernel size to reduce the computation, as well as adopting a self-attention operation for the spatial feature aggregation. In order to further learn the scale-aware feature representation well from the input image, we stack the designed SPA block in a hierarchical way and fuse their features flexibly as another crucial module of the proposed HPANet, the Hierarchical Feature Fusion (HFF) module. Combining the designed SPA block and HFF module, the developed HPANet could remedy the scale variation issue and thus improve the counting performance with the mighty scale-aware feature representation. The performance of the HPANet is evaluated on four public available benchmark datasets in this paper, including ShanghaiTech, Mall, Beijing BRT and UCF-QNRF. Extensive experimental results on benchmarks demonstrate that the proposed HPANet could have an effective performance for crowd counting and the ablation experimental results validate the effectiveness of the components of HPANet on the counting task. The designed HPANet could realize a preferable counting performance in view of alleviating the scale variation issue, without the cost of introducing too much additional parameters for the multi-column structure.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据