4.6 Article

Feature-Aware Adaptation and Density Alignment for Crowd Counting in Video Surveillance

期刊

IEEE TRANSACTIONS ON CYBERNETICS
卷 51, 期 10, 页码 4822-4833

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2020.3034316

关键词

Feature extraction; Training; Task analysis; Adaptation models; Data models; Labeling; Gallium nitride; Crowd counting; denisty estimation; unsupervised domain adaptation

资金

  1. National Natural Science Foundation of China [U1864204, 61773316, 61632018, 61825603]

向作者/读者索取更多资源

With the advancement of deep neural networks, the performance of crowd counting and pixel-wise density estimation has been improving, but challenges persist. A synthetic crowd dataset released recently helps address the difficulty in collecting data, but the domain gap between real data and synthetic images hinders model performance. To tackle this, a domain-adaptation-style crowd counting method is proposed in this article, which effectively adapts the model from synthetic data to specific real-world scenes.
With the development of deep neural networks, the performance of crowd counting and pixel-wise density estimation is continually being refreshed. Despite this, there are still two challenging problems in this field: 1) current supervised learning needs a large amount of training data, but collecting and annotating them is difficult and 2) existing methods cannot generalize well to the unseen domain. A recently released synthetic crowd dataset alleviates these two problems. However, the domain gap between the real-world data and synthetic images decreases the models' performance. To reduce the gap, in this article, we propose a domain-adaptation-style crowd counting method, which can effectively adapt the model from synthetic data to the specific real-world scenes. It consists of multilevel feature-aware adaptation (MFA) and structured density map alignment (SDA). To be specific, MFA boosts the model to extract domain-invariant features from multiple layers. SDA guarantees the network outputs fine density maps with a reasonable distribution on the real domain. Finally, we evaluate the proposed method on four mainstream surveillance crowd datasets, Shanghai Tech Part B, WorldExpo'10, Mall, and UCSD. Extensive experiments are evidence that our approach outperforms the state-of-the-art methods for the same cross-domain counting problem.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据