4.7 Article

Extended Feature Pyramid Network for Small Object Detection

Journal

IEEE TRANSACTIONS ON MULTIMEDIA
Volume 24, Issue -, Pages 1968-1979

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2021.3074273

Keywords

Feature extraction; Object detection; Detectors; Semantics; Superresolution; Signal resolution; Pipelines; deep learning; feature pyramid; feature super-resolution; knowledge distillation; Small object detection

Funding

  1. National Natural Science Foundation of China [61836015]

Ask authors/readers for more resources

In this paper, the authors propose an extended feature pyramid network (EFPN) for small object detection. They introduce a feature texture transfer (FTT) module for super-resolving features and extracting regional details, as well as a cross resolution distillation mechanism to enhance the network's ability to perceive details. Experimental results show that the proposed EFPN is computationally and memory efficient, and achieves state-of-the-art results on small object detection datasets.
Small object detection remains an unsolved challenge because it is hard to extract the information of small objects with only a few pixels. While scale-level corresponding detection in feature pyramid network alleviates this problem, we find feature coupling of various scales still impairs the performance of small objects. In this paper, we propose an extended feature pyramid network (EFPN) with an extra high-resolution pyramid level specialized for small object detection. Specifically, we design a novel module, named feature texture transfer (FTT), which is used to super-resolve features and extract credible regional details simultaneously. Moreover, we introduce a cross resolution distillation mechanism to transfer the ability of perceiving details across the scales of the network, where a foreground-background-balanced loss function is designed to alleviate area imbalance of foreground and background. In our experiments, the proposed EFPN is efficient on both computation and memory, and yields state-of-the-art results on small traffic-sign dataset Tsinghua-Tencent 100 K and small category of general object detection dataset MS COCO.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available