☆ 4.6 Article

D-NMS: A dynamic NMS network for general object detection

NEUROCOMPUTING (2022)

Journal

NEUROCOMPUTING

Volume 512, Issue -, Pages 225-234

Publisher

ELSEVIER

DOI: 10.1016/j.neucom.2022.09.080

Keywords

Non-maximum suppression; Scene complexity; Dynamic threshold; Object detection

Funding

National Natural Science Found of China
[91848111]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper proposes a dynamic NMS network (D-NMS net) to predict the optimal NMS threshold for each input image and embed it into object detectors. Experimental results demonstrate that with the help of D-NMS net, the accuracy and efficiency of detectors are significantly improved.

Non-maximum Suppression (NMS), which is used to find the optimal inferences among all candidate bounding boxes, is a significant post-processing step in most state-of-the-art object detectors. The fixed threshold scheme in the standard NMS equally treats each input image, which leads to the neglect of uniqueness. Recently, several adaptive NMS methods have been proposed and demonstrated to be supe-rior to the standard NMS with a fixed threshold. However, the adaptability performance of these methods is limited due to the deficiency of measuring the complexity of the input image. In this paper, we propose a dynamic NMS network (D-NMS net) to predict the best NMS threshold for each input image, which can be embedded into most state-of-the-art single-stage object detectors. Concretely, we first propose a uni-fied scene complexity definition for a single image according to the relationship between the P-R curve and the changing NMS threshold. Secondly, we calculate the optimal NMS threshold for each image according to the proposed definition, which is then applied as the supervision label in the training stage. Lastly, we embed the lightweight regression network, D-NMS net, into the mainstream object detectors. Extensive experiments are conducted on challenging datasets. With the help of our D-NMS net, the accu-racy and efficiency of detectors have achieved obvious improvements. On Pascal VOC, the mean Average Precision (mAP) of RetinaNet is boosted from 81.60% to 84.74%, and the mAP of FCOS is improved from 79.12% to 84.20%. On MS-COCO, the Average Precision(AP) of RetinaNet is boosted from 36.4% to 38.5%, and the AP of FCOS is improved from 37.2% to 39.1%. Meanwhile, the inference speed of our method is increased by 62% at most.(c) 2022 Elsevier B.V. All rights reserved.

D-NMS: A dynamic NMS network for general object detection

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

D-NMS: A dynamic NMS network for general object detection

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper