4.7 Article

Locality guided cross-modal feature aggregation and pixel-level fusion for multispectral pedestrian detection

期刊

INFORMATION FUSION
卷 88, 期 -, 页码 1-11

出版社

ELSEVIER
DOI: 10.1016/j.inffus.2022.06.008

关键词

Multispectral fusion; Pedestrian detection; Deep neural networks; Feature aggregation; Pixel-wise guidance

资金

  1. National Natural Science Foundation of China [52075485]

向作者/读者索取更多资源

This paper proposes a novel multispectral pedestrian detection method that generates highly discriminative features by aggregating human-related clues in multispectral images. By performing cross-modal feature aggregation and pixel-level detection fusion, the proposed method achieves improved accuracy in pedestrian detection.
Multispectral pedestrian detection has received much attention in recent years due to its superiority in detecting targets under adverse lighting/weather conditions. In this paper, we aim to generate highly discriminative multi-modal features by aggregating the human-related clues based on all available samples presented in multispectral images. To this end, we present a novel multispectral pedestrian detector performing locality guided cross-modal feature aggregation and pixel-level detection fusion. Given a number of single bounding boxes covering pedestrians in both modalities, we deploy two segmentation sub-branches to predict the existence of pedestrians on visible and thermal channels. By referring to the important locality information in the reference modality, we perform locality guided cross-modal feature aggregation to learn highly discriminative human-related features in the complementary modality by exploring the clues of all available pedestrians. Moreover, we utilize the obtained spatial locality maps to provide prediction confidence scores in visible and thermal channels and conduct pixel-wise adaptive fusion of detection results in complementary modalities. Extensive experiments demonstrate the effectiveness of our proposed method, outperforming the current state-of-the-art detectors on both KAIST and CVC-14 multispectral pedestrian detection datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据