4.7 Article

Learning scene-adaptive pseudo annotations for pedestrian detection in semi-supervised scenarios

期刊

KNOWLEDGE-BASED SYSTEMS
卷 243, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2022.108439

关键词

Pedestrian detection; Semi-supervised learning; Domain adaptation; Collaborative training

资金

  1. National Natural Science Foundation China [62072189]
  2. Research Grants Council of the Hong Kong Special Administration Region [CityU 11201220]
  3. Natural Science Foundation of Guangdong Province, PR China [2020A1515010484]

向作者/读者索取更多资源

Sufficient labeled training data may not be available for pedestrian detection in many real-world scenes. In this paper, a Scene-adaptive Pseudo Annotation (SaPA) approach is proposed to improve the generalization performance and pseudo annotation quality for training a more precise and scene-specific pedestrian detector, by exploiting both source data with sufficient supervision and unannotated target data with domain-specific information.
Sufficient labeled training data may not be available for pedestrian detection in many real-world scenes. Semi-supervised settings naturally apply for the case where an adequate number of images are collected in a target scene but only a small proportion of them can be manually annotated. A common strategy is to adopt a detector trained on a well-established dataset (source data) or the limited annotated data to pseudo-annotate unannotated images. However, the domain gap and the lack of supervision in the target scene may lead to low-quality pseudo annotations. In this paper, we propose a Scene-adaptive Pseudo Annotation (SaPA) approach, which aims at exploiting two types of training data: source data providing sufficient supervision and unannotated target data offering domain-specific information. To utilize the source data, an Annotation Network (AnnNet) competes with a domain discriminator to learn domain-invariant features. To exploit the unannotated data, we temporally aggregate the parameters of AnnNet to build a more robust network, which is able to provide training goals for AnnNet. This new approach improves the generalization performance of AnnNet, which eventually leads to high-quality pseudo annotations to the unannotated data. Both manual and pseudo annotations are leveraged to train a more precise and scene-specific detector. We perform extensive experiments on multiple benchmarks to verify the effectiveness and superiority of SaPA.(c) 2022 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据