☆ 4.7 Article

Object detectors involving a NAS-gate convolutional module and capsule attention module

SCIENTIFIC REPORTS (2022)

期刊

SCIENTIFIC REPORTS

卷 12, 期 1, 页码 -

出版社

NATURE PORTFOLIO

DOI: 10.1038/s41598-022-07898-7

关键词

类别

Multidisciplinary Sciences

资金

National R&D Project of Development of automatic screening and hybrid detection system for hazardous material detecting in port container [20200611]
Technology development Program of MSS [S3146559]
National Research Foundation of Korea (NRF) - Korean government (MSIT) [NRF-2020R1A4A1016619]
Korea Medical Device Development Fund - Korea government (Ministry of Science and ICT) [KMDF_PR_20200901_0016, 9991006689]
Korea Medical Device Development Fund - Korea government (Ministry of Trade, Industry and Energy) [KMDF_PR_20200901_0016, 9991006689]
Korea Medical Device Development Fund - Korea government (Ministry of Health Welfare) [KMDF_PR_20200901_0016, 9991006689]
Korea Medical Device Development Fund - Korea government (Ministry of Food and Drug Safety) [KMDF_PR_20200901_0016, 9991006689]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper improves the performance of object detectors by modifying the backbone architecture and feature pyramid using Neural Architecture Search (NAS) and Capsule Network. The NAS-gate convolutional module deals with object scale variation, while the Capsule Attention module optimizes feature representation and localization capability. Results show that NASGC-CapANet outperforms the baseline models on multiple datasets.

Several state-of-the-art object detectors have demonstrated outstanding performances by optimizing feature representation through modification of the backbone architecture and exploitation of a feature pyramid. To determine the effectiveness of this approach, we explore the modification of object detectors' backbone and feature pyramid by utilizing Neural Architecture Search (NAS) and Capsule Network. We introduce two modules, namely, NAS-gate convolutional module and Capsule Attention module. The NAS-gate convolutional module optimizes standard convolution in a backbone network based on differentiable architecture search cooperation with multiple convolution conditions to overcome object scale variation problems. The Capsule Attention module exploits the strong spatial relationship encoding ability of the capsule network to generate a spatial attention mask, which emphasizes important features and suppresses unnecessary features in the feature pyramid, in order to optimize the feature representation and localization capability of the detectors. Experimental results indicate that the NAS-gate convolutional module can alleviate the object scale variation problem and the Capsule Attention network can help to avoid inaccurate localization. Next, we introduce NASGC-CapANet, which incorporates the two modules, i.e., a NAS-gate convolutional module and capsule attention module. Results of comparisons against state-of-the-art object detectors on the MS COCO val-2017 dataset demonstrate that NASGC-CapANet-based Faster R-CNN significantly outperforms the baseline Faster R-CNN with a ResNet-50 backbone and a ResNet-101 backbone by mAPs of 2.7% and 2.0%, respectively. Furthermore, the NASGC-CapANet-based Cascade R-CNN achieves a box mAP of 43.8% on the MS COCO test-dev dataset.

Object detectors involving a NAS-gate convolutional module and capsule attention module

期刊

SCIENTIFIC REPORTS

出版社

NATURE PORTFOLIO

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Object detectors involving a NAS-gate convolutional module and capsule attention module

期刊

SCIENTIFIC REPORTS

出版社

NATURE PORTFOLIO

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文