4.4 Article

Gaussian guided IoU: A better metric for balanced learning on object detection

Journal

IET COMPUTER VISION
Volume 16, Issue 6, Pages 556-566

Publisher

WILEY
DOI: 10.1049/cvi2.12113

Keywords

-

Ask authors/readers for more resources

This paper presents an anchor-based detection method that uses Gaussian Guided IoU for target assignment and proposes corresponding balanced learning methods to address the issues of insufficient supervision for slender objects and inaccurate feature alignment.
Most anchor-based detectors use intersection over union (IoU) to assign targets to anchors during training. However, IoU did not pay enough attention to the proximity of the anchor's centre to the centre of the truth box, resulting in two issues: (1) the most slender objects were given just one anchor, resulting in insufficient supervision information for slender objects during training; (2) IoU cannot accurately represent the degree of alignment between the feature's receptive field at the anchor's centre and the object. As a result, some features with good alignment degrees are missing, while others with poor alignment degrees are used, reducing the model's localisation accuracy. To address these issues, we first created a Gaussian Guided IoU (GGIoU), which prioritises the proximity of the anchor's centre to the truth box's centre. We then proposed GGIoU-balanced learning methods, including GGIoU-guided assignment strategy and GGIoU-balanced localisation loss. This method can assign multiple anchors to each slender object, favouring features that are well-aligned with the objects during the training process. A large number of experiments show that GGIoU-balanced learning can solve the aforementioned problems and significantly improve the detection model's performance.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available