4.6 Article

SRNet: Scale-Aware Representation Learning Network for Dense Crowd Counting

期刊

IEEE ACCESS
卷 9, 期 -, 页码 136032-136044

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3115963

关键词

Feature extraction; Task analysis; Convolution; Estimation; Decoding; Semantics; Location awareness; Dense crowd counting; multi-scale feature learning; deep learning; convolution neural network

资金

  1. Natural Science Foundation of Shanghai [19ZR1455300]
  2. National Natural Science Foundation of China [61806126]

向作者/读者索取更多资源

This paper introduces a scale-aware representation learning network (SRNet) to address scale variation issues in dense crowd counting and crowd localization tasks. By encoding and decoding images and using two modules for multi-scale feature learning and spatial resolution enhancement, the SRNet has been proven effective in both qualitative and quantitative experiments.
Huge variations in the scales of people in images create an extremely challenging problem in the task of crowd counting. Currently, many researchers apply multi-column structures to solve the scale variation problem. However, multi-column structures usually have complex structures with large numbers of parameters and are difficult to optimize. To this end, we propose a scale-aware representation learning network (SRNet) that uses a commonly used encoder-decoder framework. An image is converted into deep features by the first ten layers of VGG16 in the encoder. Then, the features are regressed to a crowd density map via the decoder. The decoder mainly consists of two modules: the scale-aware feature learning module (SAM) and the pixel-aware upsampling module (PAM). SAM models the multi-scale features of a crowd at each level with different sizes of receptive fields, and PAM enlarges the spatial resolution and enhances the pixel-level semantic information, thereby improving the overall counting accuracy. We conduct extensive crowd counting experiments on ShanghaiTech Part_A, UCF-QNRF, and UCF_CC_50 datasets. Furthermore, to obtain the locations of each person, we conduct crowd localization experiments on UCF-QNRF and NWPU-Crowd datasets. The qualitative and quantitative results prove the effectiveness of the SRNet in dense crowd counting and crowd localization tasks.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据