☆ 4.6 Article

A multi-scale and multi-level feature aggregation network for crowd counting

NEUROCOMPUTING (2021)

期刊

NEUROCOMPUTING

卷 423, 期 -, 页码 46-56

出版社

ELSEVIER

DOI: 10.1016/j.neucom.2020.09.059

关键词

Crowd counting; Multi-scale; Multi-level; Normalized Euclidean loss

类别

Computer Science, Artificial Intelligence

资金

National Natural Science Foundation of China [11872069]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this paper, a multi-scale and multi-level features aggregation network (MFANet) is proposed for accurate and efficient crowd counting. By introducing the scale and level aggregation module (SLAM) and normalized Euclidean loss (NEL), the network achieves state-of-the-art performance in crowd counting and localization, as demonstrated in extensive experiments on benchmark datasets.

Recently, crowd counting has drawn widespread attention in computer vision, but it is extremely challenging because of the varying scales and densities. Many existing methods focus on improving the multi-scale representation by utilizing multi-column or multi-branch architectures with different kernel sizes. However, such networks cannot extract the feature maps with large receptive fields due to limitation of depth. In addition, the importance of utilizing the multi-level feature information in a deep network is ignored. In this paper, we propose a multi-scale and multi-level features aggregation network (MFANet) for accurate and efficient crowd counting, and it can be trained by end-to-end. A vital component of the network is the scale and level aggregation module (SLAM), which can extract multi-scale features and make full use of multi-level feature information for more accurate estimation. When six SLAMs are stacked together and applied to our network, our method can achieve the best performance. Furthermore, we introduce a new loss function called normalized Euclidean loss (NEL) to balance the contribution of all samples to network training. To demonstrate the performance of the proposed method, extensive experiments are conducted on four benchmark crowd counting datasets, including ShanghaiTec Part A/B, UCF-CC-50, Mall, and UCF-QNRF. Experimental results show that our MFANet achieves state-of-the-art performance in crowd counting and crowd localization. (c) 2020 Elsevier B.V. All rights reserved.

A multi-scale and multi-level feature aggregation network for crowd counting

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A multi-scale and multi-level feature aggregation network for crowd counting

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文