4.7 Article

Learning scene-adaptive pseudo annotations for pedestrian detection in semi-supervised scenarios

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 243, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2022.108439

Keywords

Pedestrian detection; Semi-supervised learning; Domain adaptation; Collaborative training

Funding

  1. National Natural Science Foundation China [62072189]
  2. Research Grants Council of the Hong Kong Special Administration Region [CityU 11201220]
  3. Natural Science Foundation of Guangdong Province, PR China [2020A1515010484]

Ask authors/readers for more resources

Sufficient labeled training data may not be available for pedestrian detection in many real-world scenes. In this paper, a Scene-adaptive Pseudo Annotation (SaPA) approach is proposed to improve the generalization performance and pseudo annotation quality for training a more precise and scene-specific pedestrian detector, by exploiting both source data with sufficient supervision and unannotated target data with domain-specific information.
Sufficient labeled training data may not be available for pedestrian detection in many real-world scenes. Semi-supervised settings naturally apply for the case where an adequate number of images are collected in a target scene but only a small proportion of them can be manually annotated. A common strategy is to adopt a detector trained on a well-established dataset (source data) or the limited annotated data to pseudo-annotate unannotated images. However, the domain gap and the lack of supervision in the target scene may lead to low-quality pseudo annotations. In this paper, we propose a Scene-adaptive Pseudo Annotation (SaPA) approach, which aims at exploiting two types of training data: source data providing sufficient supervision and unannotated target data offering domain-specific information. To utilize the source data, an Annotation Network (AnnNet) competes with a domain discriminator to learn domain-invariant features. To exploit the unannotated data, we temporally aggregate the parameters of AnnNet to build a more robust network, which is able to provide training goals for AnnNet. This new approach improves the generalization performance of AnnNet, which eventually leads to high-quality pseudo annotations to the unannotated data. Both manual and pseudo annotations are leveraged to train a more precise and scene-specific detector. We perform extensive experiments on multiple benchmarks to verify the effectiveness and superiority of SaPA.(c) 2022 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available