4.5 Article

Regularizing deep networks with label geometry for accurate object localization on small training datasets

Journal

PATTERN RECOGNITION LETTERS
Volume 154, Issue -, Pages 53-59

Publisher

ELSEVIER
DOI: 10.1016/j.patrec.2022.01.004

Keywords

Object detection; Object localization; Label geometry; Box evolution; Small dataset; Human-machine interaction

Funding

  1. National Key R&D Program of China [2021YFF0602101]
  2. National Science Founda-tion of China [NSFC 61906194]

Ask authors/readers for more resources

Localization is a critical subtask in object detection, and this work proposes a method to improve localization performance on small datasets by fully exploiting limited annotations. By extracting label geometry and generating distance transform, the method reconstructs object geometry through pixel-wise supervision, and enhances geometric-aware features through coupled training with regression.
Localization is a critical subtask in object detection, which is closely related to spatial information of objects. Most current detectors simply rely on the fitting ability of deep neural networks to regress towards numerical targets such as coordinates of object boxes. Training deep networks for sufficient fitting ability requires a large number of annotations that are expensive to obtain. In this work, we fully exploit limited annotations by extracting label geometry to improve localization performance on small datasets. We generate distance transform of bounding box edges according to localization labels, with which we supervise intermediate outputs of networks pixel by pixel to reconstruct object geometry for localization. Distance transform is sensitive to box edges and provides geometric gradients flowing into boundaries. We learn such gradients to enhance geometric-aware features through a coupled training with regression, and use it to refine regressed boxes in an evolutionary manner in inference. Extensive experiments are implemented to demonstrate the effectiveness of our method. Our method can be applied in applications that required human-machine interaction, such as the driver-assistance system in autonomous driving, by providing accurate detections to assist humans in making better decisions.(c) 2022 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available