3.8 Proceedings Paper

Multimodal Object Detection via Probabilistic Ensembling

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Artificial Intelligence

Weighted boxes fusion: Ensembling boxes from different object detection models

Roman Solovyev et al.

Summary: This study introduces a novel method, weighted boxes fusion, for combining predictions from different object detection models, significantly improving the quality of the ensemble predicted rectangles. The method achieved top results in various datasets and challenges, with the 3D version of boxes fusion being successfully applied in winning teams of specific competitions.

IMAGE AND VISION COMPUTING (2021)

Article Computer Science, Information Systems

Bottom-up and Layerwise Domain Adaptation for Pedestrian Detection in Thermal Images

My Kieu et al.

Summary: This article explores domain adaptation approaches to adapt RGB-trained detectors to the thermal domain for pedestrian detection. Experimental evaluation shows that their bottom-up domain adaptation techniques outperform the best-performing single-modality pedestrian detection results on KAIST and outperform the state of the art on FLIR. The study proposes two new bottom-up domain adaptation strategies and highlights the importance of adapting detectors to variable lighting conditions for safety and security applications.

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2021)

Article Robotics

MLPD: Multi-Label Pedestrian Detector in Multispectral Domain

Jiwon Kim et al.

Summary: This study introduces a novel single-stage detection framework for multispectral pedestrian detection, which leverages multi-label learning to learn input state-aware features, and proposes a novel augmentation strategy to handle unpaired multispectral images.

IEEE ROBOTICS AND AUTOMATION LETTERS (2021)

Proceedings Paper Computer Science, Artificial Intelligence

There is More than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking with Sound by Distilling Multimodal Knowledge

Francisco Rivera Valverde et al.

Summary: Sound attributes of objects help in object detection and tracking. This study proposes a self-supervised framework that leverages multiple modalities to distill knowledge into an audio student network. Experimental results show that the approach outperforms existing methods in detecting multiple objects using only sound during inference.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Guided Attentive Feature Fusion for Multispectral Pedestrian Detection

Heng Zhang et al.

Summary: The proposed novel attentive multispectral feature fusion approach utilizes deep learning architecture to dynamically weigh and fuse multispectral features guided by inter- and intra-modality attention modules, significantly improving pedestrian detection accuracy at a low computation cost based on experiments on two public datasets.

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

SyNet: An Ensemble Network for Object Detection in UAV Images

Berat Mert Albaba et al.

Summary: Recent advances in camera equipped drone applications have led to an increased demand for vision based object detection algorithms in aerial images. This paper proposes an ensemble network, SyNet, which combines a multi-stage method with a single-stage method to improve object detection accuracy. The results obtained on two different datasets, MS-COCO and visDrone, demonstrate the effectiveness of the proposed solution in achieving state of the art performance.

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2021)

Proceedings Paper Imaging Science & Photographic Technology

MULTISPECTRAL FUSION FOR OBJECT DETECTION WITH CYCLIC FUSE-AND-REFINE BLOCKS

Heng Zhang et al.

2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) (2020)

Article Computer Science, Artificial Intelligence

Cross-modality interactive attention network for multispectral pedestrian detection

Lu Zhang et al.

INFORMATION FUSION (2019)

Article Computer Science, Artificial Intelligence

Fusion of multispectral data through illumination-aware deep neural networks for pedestrian detection

Dayan Guan et al.

INFORMATION FUSION (2019)

Article Computer Science, Artificial Intelligence

Illumination-aware faster R-CNN for robust multispectral pedestrian detection

Chengyang Li et al.

PATTERN RECOGNITION (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Fully Convolutional Region Proposal Networks for Multispectral Person Detection

Daniel Koenig et al.

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Learning non-maximum suppression

Jan Hosang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Mask R-CNN

Kaiming He et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Soft-NMS - Improving Object Detection With One Line of Code

Navaneeth Bodla et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Article Computer Science, Artificial Intelligence

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2015)

Article Computer Science, Artificial Intelligence

The PASCAL Visual Object Classes Challenge: A Retrospective

Mark Everingham et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2015)

Article Computer Science, Artificial Intelligence

Pedestrian Detection: An Evaluation of the State of the Art

Piotr Dollar et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2012)