4.7 Article

An Improved Discriminative Model Prediction Approach to Real-Time Tracking of Objects With Camera as Sensors

Journal

IEEE SENSORS JOURNAL
Volume 21, Issue 15, Pages 17308-17317

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JSEN.2021.3079214

Keywords

Visualization; Semantics; Training data; Predictive models; Sensor fusion; Cameras; Feature extraction; Model prediction; data augmentation; feature fusion; person tracking; visual sensing

Funding

  1. National Natural Science Foundation of China [61305014]
  2. Shanghai Municipal Education Commission
  3. Shanghai Education Development Foundation under Chen Guang Project [13CG6013CG60]
  4. Ministry of Education
  5. King Abdulaziz University, Jeddah, Saudi Arabia
  6. [IFPNC-001-135-2020]

Ask authors/readers for more resources

This study proposes an enhanced discriminative model prediction method using efficient data augmentation and feature fusion to improve person tracking performance. The proposed tracker achieves real-time discrimination of objects in complex scenarios through multi-layer feature fusion and effective data augmentation strategy.
Generic person tracking is a basic task in visual surveillance by using camera as sensors. Many deep learning-based trackers have obtained outstanding performance. Among them, trackers based on Siamese networks have drawn great attention and are promising. These training methods are competitive but training data can be more effectively augmented to improve their person tracking performance. Many other trackers use only one layer to extract semantic features, likely hindering their discriminative learning. In this paper, we propose an enhanced discriminative model prediction method with efficient data augmentation and robust feature fusion. Specifically, we propose to implement an effective data augmentation strategy (e.g., color jitter and motion blur) to unleash the greater potential of original training data. We also adopt a multi-layer feature fusion to obtain a more discriminative feature map. Thus, the proposed tracker can discriminate an object in complicated scenarios in real time. We conduct extensive experiments on two datasets, i.e., VOT2018 and UAV123. Objective evaluation on VOT2018 demonstrates that with its expected average overlap value of 0.430, it outperforms a state-of-the-art tracker by 4.88%. On UAV123, it does so by 4.5% in success rate and 4.4% in precision rate. In addition, our further experimental results reveal that our algorithm can reach a speed that is high enough to meet the real-time tracking requirement when camera are used as sensors.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available