4.7 Article

An Improved Discriminative Model Prediction Approach to Real-Time Tracking of Objects With Camera as Sensors

期刊

IEEE SENSORS JOURNAL
卷 21, 期 15, 页码 17308-17317

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JSEN.2021.3079214

关键词

Visualization; Semantics; Training data; Predictive models; Sensor fusion; Cameras; Feature extraction; Model prediction; data augmentation; feature fusion; person tracking; visual sensing

资金

  1. National Natural Science Foundation of China [61305014]
  2. Shanghai Municipal Education Commission
  3. Shanghai Education Development Foundation under Chen Guang Project [13CG6013CG60]
  4. Ministry of Education
  5. King Abdulaziz University, Jeddah, Saudi Arabia
  6. [IFPNC-001-135-2020]

向作者/读者索取更多资源

This study proposes an enhanced discriminative model prediction method using efficient data augmentation and feature fusion to improve person tracking performance. The proposed tracker achieves real-time discrimination of objects in complex scenarios through multi-layer feature fusion and effective data augmentation strategy.
Generic person tracking is a basic task in visual surveillance by using camera as sensors. Many deep learning-based trackers have obtained outstanding performance. Among them, trackers based on Siamese networks have drawn great attention and are promising. These training methods are competitive but training data can be more effectively augmented to improve their person tracking performance. Many other trackers use only one layer to extract semantic features, likely hindering their discriminative learning. In this paper, we propose an enhanced discriminative model prediction method with efficient data augmentation and robust feature fusion. Specifically, we propose to implement an effective data augmentation strategy (e.g., color jitter and motion blur) to unleash the greater potential of original training data. We also adopt a multi-layer feature fusion to obtain a more discriminative feature map. Thus, the proposed tracker can discriminate an object in complicated scenarios in real time. We conduct extensive experiments on two datasets, i.e., VOT2018 and UAV123. Objective evaluation on VOT2018 demonstrates that with its expected average overlap value of 0.430, it outperforms a state-of-the-art tracker by 4.88%. On UAV123, it does so by 4.5% in success rate and 4.4% in precision rate. In addition, our further experimental results reveal that our algorithm can reach a speed that is high enough to meet the real-time tracking requirement when camera are used as sensors.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据