4.5 Article

SC2Net: Scale-aware Crowd Counting Network with Pyramid Dilated Convolution

期刊

APPLIED INTELLIGENCE
卷 53, 期 5, 页码 5146-5159

出版社

SPRINGER
DOI: 10.1007/s10489-022-03648-4

关键词

Crowd counting; Crowd localization; Multi-scale feature learning; Residual network; Pyramid convolution

向作者/读者索取更多资源

This paper proposes an efficient scale-aware crowd counting network called SC2Net, which adopts an encoder-decoder framework and residual pyramid dilated convolution modules to extract multi-scale information and regress predicted density maps. Experimental results demonstrate the superiority of our proposed method over other state-of-the-art methods.
Accurate crowd counting is still challenging due to the variations of crowd heads. Most of crowd counting methods adopt multi-branch networks to extract multi-scale information. However, these networks are too complex to be optimized. To solve these problems, we propose an efficient scale-aware crowd counting network named SC2Net, which adopts the encoder-decoder framework. The encoder uses the first ten layers of VGG16 to extract the primary feature information. The decoder is mainly consisted of our proposed residual pyramid dilated convolution (ResPyDConv) modules to regress predicted density maps. Specifically, the ResPyDConv module is composed of pyramid dilated convolution (PyDConv). Each PyDConv adopts dilated convolutions with different dilated rates. PyDConv divides feature maps into different groups and extracts multi-scale feature information. Extensive experiments are conducted on ShanghaiTech, UCF_CC_50, UCF_QNRF, and NWPU_Crowd datasets. Qualitative and quantitive results show the superiority of our proposed network to the other state-of-the-art methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据