4.4 Article

Domain adaptive crowd counting via dynamic scale aggregation network

期刊

IET COMPUTER VISION
卷 -, 期 -, 页码 -

出版社

WILEY
DOI: 10.1049/cvi2.12198

关键词

computer vision; image processing

向作者/读者索取更多资源

Crowd counting is crucial in computer vision, aiming to estimate the number of people in an image. By regressing density maps, researchers have greatly improved the counting accuracy in recent years. However, due to domain shift, models trained on richly labeled datasets (source domain) do not perform well on datasets with limited labels (target domain). To address this issue, a novel dynamic scale aggregation network (DSANet) is proposed to bridge the gap in style and cross-domain head scale variations.
Crowd counting is an important research topic in computer vision. Its goal is to estimate the people's number in an image. Researchers have dramatically improved counting accuracy in recent years by regressing density maps. However, because of the inherent domain shift, the model trained on an expensive manually labelled dataset (source domain) does not perform well on a dataset with scarce labels (target domain). For this issue, a novel dynamic scale aggregation network (DSANet) is proposed to reduce the gaps in style and cross-domain head scale variations. Specifically, a practical style transfer layer is introduced to reduce the appearance discrepancy between the source and target domains. Then, the translated source and target domain samples are encoded by a generator consisting of the VGG16 network and the dynamic scale aggregation modules (DSA Modules) and produce corresponding density maps. The DSA module can adaptively adjust parameters according to the input features and effectively fuse multi-scale information to overcome the cross-domain head scale variations. Next, a discriminator judges the input density map from the source or target domain. Last, domain distributions are aligned through adversarial between the generator and the discriminator. The experiments show that our network outperforms the current state-of-the-art methods and can improve the target domain's performance while maintaining the source domain's performance without significant degradation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据