4.7 Article

CT-Net: An Efficient Network for Low-Altitude Object Detection Based on Convolution and Transformer

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIM.2022.3165838

Keywords

Convolutional neural network (CNN); low-altitude; object detection; transformer; unmanned aerial vehicles (UAVs)

Funding

  1. Foundation of the State Key Laboratory of Coal Mining and Clean Utilization, China [2021CMCUKF012]
  2. National Natural Science Foundation of China [52121003]
  3. Fundamental Research Funds for the Central Universities [2022YJSJD01, 2022YQJD04]

Ask authors/readers for more resources

In this article, a deep learning method called CT-Net is proposed for low-altitude small-object detection. It addresses the limitations of existing detection methods in accuracy, model size, and speed through the introduction of an attention-enhanced transformer block, a lightweight bottleneck module, and a directional feature fusion structure. Experimental results show that CT-Net outperforms other detectors on low-altitude small-object datasets and MS COCO.
With the growing popularity of civilian unmanned aerial vehicles (UAVs), unauthorized flights are on the rise accordingly. Therefore, it is critical to detect low-altitude UAVs for protecting personal privacy and public safety. Though substantial progress has been made in UAV detection, the existing detection methods still have problems in balancing the detection accuracy, the model size, and the detection speed. To address these limitations, this article proposes a novel deep learning method named convolution-transformer network (CT-Net). First, the attention-enhanced transformer block (AETB), which builds a feature-enhanced multihead self-attention (FEMSA), is introduced into the backbone of the network to improve the feature extraction ability of the model. Then, a lightweight bottleneck module (LBM) is utilized to control the computation load and reduce the parameters. Finally, we present a directional feature fusion structure (DFFS) to improve the accuracy of detection when processing multiscale objects, especially small-size objects. The proposed scheme achieves 0.966 mAP with the input size of 640 x 640 pixels on our low-altitude small-object dataset, and the performance is superior to YOLOv5. Furthermore, the experimental result on MS COCO shows that the CT-Net can outperform the current state-of-the-art detectors by a large margin. Thus, it is feasible to apply CT-Net to low-altitude small-object detection according to the experimental results.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available