☆ 4.4 Article

Domain adaptive crowd counting via dynamic scale aggregation network

IET COMPUTER VISION (2023)

期刊

IET COMPUTER VISION

卷 -, 期 -, 页码 -

出版社

WILEY

DOI: 10.1049/cvi2.12198

关键词

computer vision; image processing

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Crowd counting is crucial in computer vision, aiming to estimate the number of people in an image. By regressing density maps, researchers have greatly improved the counting accuracy in recent years. However, due to domain shift, models trained on richly labeled datasets (source domain) do not perform well on datasets with limited labels (target domain). To address this issue, a novel dynamic scale aggregation network (DSANet) is proposed to bridge the gap in style and cross-domain head scale variations.

Crowd counting is an important research topic in computer vision. Its goal is to estimate the people's number in an image. Researchers have dramatically improved counting accuracy in recent years by regressing density maps. However, because of the inherent domain shift, the model trained on an expensive manually labelled dataset (source domain) does not perform well on a dataset with scarce labels (target domain). For this issue, a novel dynamic scale aggregation network (DSANet) is proposed to reduce the gaps in style and cross-domain head scale variations. Specifically, a practical style transfer layer is introduced to reduce the appearance discrepancy between the source and target domains. Then, the translated source and target domain samples are encoded by a generator consisting of the VGG16 network and the dynamic scale aggregation modules (DSA Modules) and produce corresponding density maps. The DSA module can adaptively adjust parameters according to the input features and effectively fuse multi-scale information to overcome the cross-domain head scale variations. Next, a discriminator judges the input density map from the source or target domain. Last, domain distributions are aligned through adversarial between the generator and the discriminator. The experiments show that our network outperforms the current state-of-the-art methods and can improve the target domain's performance while maintaining the source domain's performance without significant degradation.

Domain adaptive crowd counting via dynamic scale aggregation network

期刊

IET COMPUTER VISION

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Domain adaptive crowd counting via dynamic scale aggregation network

期刊

IET COMPUTER VISION

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文