4.6 Article

Adaptive spatial pixel-level feature fusion network for multispectral pedestrian detection

期刊

INFRARED PHYSICS & TECHNOLOGY
卷 116, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.infrared.2021.103770

关键词

Adaptive fusion; Pixel-level feature fusion; Multispectral pedestrian detection; Spatial attention module; Pixel attention module

向作者/读者索取更多资源

An adaptive spatial pixel-level feature fusion network is proposed to effectively fuse information from visible and thermal infrared images, improving pedestrian detection performance. Experimental results demonstrate that the method achieves a good balance between detection speed and accuracy on the KAIST dataset.
A pedestrian detector that uses visible and thermal infrared image pairs as the input has better detection performance than a detector that uses only visible image under challenging illumination conditions. With the aim to efficiently and effectively fuse complementary information from visible and thermal infrared images, this paper proposes an adaptive spatial pixel-level feature fusion network called the ASPFF Net, which can adaptively extract spatial pixel-level features from visible and infrared images for fusion. Specifically, first, two light networks with different weights are used to extract multi-scale features of visible and infrared images. Next, for features of the same scale but different modalities, the fusion weights of different spatial positions and pixels in the two feature maps are obtained by the spatial attention module (SAM) and pixel attention module (PAM). The original features of visible and infrared images are recalibrated by the fusion weights, and multi-scale fused feature layers are obtained. Finally, different scales of pedestrians are detected on the fused multi-scale feature layers. Compared with the other recent multispectral pedestrian detectors on the reasonable subset of the KAIST multispectral pedestrian detection dataset, the proposed detector is attractive in balancing speed and accuracy. The extensive experiments on the KAIST dataset demonstrate the effectiveness of the proposed method for the fusion of visible and infrared image in multispectral pedestrian detection.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据