4.5 Article

On-Board Small-Scale Object Detection for Unmanned Aerial Vehicles (UAVs)

Journal

DRONES
Volume 7, Issue 5, Pages -

Publisher

MDPI
DOI: 10.3390/drones7050310

Keywords

object detection; computer vision; modified CenterNet; VisDrone; SSD; AU-AIR; object detectors; Jetson Xavier; Jetson Nano; NCS2

Categories

Ask authors/readers for more resources

Object detection is challenging with aerial images due to small target sizes, low resolution, occlusion, attitude, and scale variations. We modified the CenterNet architecture and used different CNN-based backbones to improve performance. The modified CenterNet achieved promising results on challenging datasets and was compared with other popular object detectors. The approach was also optimized and implemented on edge platforms, showing good performance compared to the latest cutting-edge research on both discrete GPU and edge platforms.
Object detection is a critical task that becomes difficult when dealing with onboard detection using aerial images and computer vision technique. The main challenges with aerial images are small target sizes, low resolution, occlusion, attitude, and scale variations, which affect the performance of many object detectors. The accuracy of the detection and the efficiency of the inference are always trade-offs. We modified the architecture of CenterNet and used different CNN-based backbones of ResNet18, ResNet34, ResNet50, ResNet101, ResNet152, Res2Net50, Res2Net101, DLA-34, and hourglass14. A comparison of the modified CenterNet with nine CNN-based backbones is conducted and validated using three challenging datasets, i.e., VisDrone, Stanford Drone dataset (SSD), and AU-AIR. We also implemented well-known off-the-shelf object detectors, i.e., YoloV1 to YoloV7, SSD-MobileNet-V2, and Faster RCNN. The proposed approach and state-of-the-art object detectors are optimized and then implemented on cross-edge platforms, i.e., NVIDIA Jetson Xavier, NVIDIA Jetson Nano, and Neuro Compute Stick 2 (NCS2). A detailed comparison of performance between edge platforms is provided. Our modified CenterNet combination with hourglass as a backbone achieved 91.62%, 75.61%, and 34.82% mAP using the validation sets of AU-AIR, SSD, and VisDrone datasets, respectively. An FPS of 40.02 was achieved using the ResNet18 backbone. We also compared our approach with the latest cutting-edge research and found promising results for both discrete GPU and edge platforms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available