☆ 4.7 Article

Weighted boxes fusion: Ensembling boxes from different object detection models

IMAGE AND VISION COMPUTING (2021)

期刊

IMAGE AND VISION COMPUTING

卷 107, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.imavis.2021.104117

关键词

Object detection; Computer vision; Deep learning

类别

Computer Science, Artificial Intelligence Computer Science, Software Engineering Computer Science, Theory & Methods Engineering, Electrical & Electronic Optics

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study introduces a novel method, weighted boxes fusion, for combining predictions from different object detection models, significantly improving the quality of the ensemble predicted rectangles. The method achieved top results in various datasets and challenges, with the 3D version of boxes fusion being successfully applied in winning teams of specific competitions.

Object detection is a crucial task in computer vision systems with a wide range of applications in autonomous driving, medical imaging, retail, security, face recognition, robotics, and others. Nowadays, neural networks based models are used to localize and classify instances of objects of particular classes. When real-time inference is not required, ensembles of models help to achieve better results. In this work, we present a novel method for fusing predictions from different object detection models: weighted boxes fusion. Our algorithm utilizes confidence scores of all proposed bounding boxes to construct averaged boxes. We tested the method on several datasets and evaluated it in the context of Open Images and COCO Object Detection challenges, achieving top results in these challenges. The 3D version of boxes fusion was successfully applied by the winning teams of Waymo Open Dataset and Lyft 3D Object Detection for Autonomous Vehicles challenges. The source code is publicly available at GitHub (Solovyev, 2019 [31]). We present a novel method for combining predictions in ensembles of different object detection models: weighted boxes fusion. This method significantly improves the quality of the fused predicted rectangles for an ensemble. We tested the method on several datasets and evaluated it in the context of the Open Images and COCO Object Detection challenges. It helped to achieve top results in these challenges. The source code is publicly available at GitHub. (c) 2021 Published by Elsevier B.V.

Weighted boxes fusion: Ensembling boxes from different object detection models

期刊

IMAGE AND VISION COMPUTING

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Weighted boxes fusion: Ensembling boxes from different object detection models

期刊

IMAGE AND VISION COMPUTING

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文