4.7 Article

Cross-Modal Object Detection Via UAV

期刊

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY
卷 72, 期 8, 页码 10894-10905

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TVT.2023.3262129

关键词

Unmanned aerial vehicles; cross modality; object detection; knowledge distillation

向作者/读者索取更多资源

This paper proposes a cross-modal knowledge distillation (CKD) enabled object detection paradigm for UAV-based target detection. It achieves comparable detection performance with multi-modal techniques while requiring less computational resources.
UAV-based object detection aims at locating and recognizing targets in aerial images, which is widely applied to traffic surveillance, disaster rescue and anomaly monitoring. However, due to expensive sensors and complicated architectures, it is unrealistic to deploy precise but heavy multi-modal object detectors into UAV nodes. To get over the dilemma, inspired by model compression and cross-modal signal processing techniques, this paper proposes a cross-modal knowledge distillation (CKD) enabled object detection paradigm, which achieves comparable detection performance with multi-modal techniques, yet requires less computational resource. On the one hand, in order to avoid transferring redundant feature knowledge, we design a Selective Feature Imitation (SFI) to selectively shorten the distance between cross-modal features according to their types. On the other hand, in order to transfer the most valuable prediction knowledge, we design an Adaptive Prediction Imitation (API). It evaluates the quality of prediction knowledge, and then adaptively adjusts the distillation intensity for cross-modal prediction. Extensive experiments on the DroneVehicle dataset have shown the performance improvement of the proposed scheme.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据