4.6 Article

Real-time and robust visual tracking with scene-perceptual memory

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jvcir.2023.103825

Keywords

Visual tracking; Correlation filter; Scene-perceptual memory; Unmanned aerial vehicle; Aerial object tracking

Ask authors/readers for more resources

Unmanned aerial vehicle (UAV) based aerial visual tracking is a research hotspot, but mainstream UAV trackers have two shortcomings: accuracy-speed trade-off and restriction from object occlusion and camera motion. To address these, a Fast-AutoTrack tracker based on scene-perceptual memory is proposed, which uses a confidence score to perceive and judge tracking anomalies and predicts search regions for object re-detection. The model updating is accelerated using the perceptual hashing algorithm. On aerial tracking datasets, Fast-AutoTrack outperforms AutoTrack in terms of speed while maintaining similar accuracy, demonstrating a favorable accuracy-speed trade-off.
Unmanned aerial vehicle (UAV) based aerial visual tracking is one of the research hotspots in computer vision. However, the mainstream trackers for UAV still have two shortcomings: (1) the accuracy of correlation filter tracker is greatly improved with more complex model, it impedes accuracy-speed trade-off. (2) object occlusion and camera motion in the aerial tracking scene also seriously restrict the application of aerial tracking. To address these problems, and inspired by AutoTrack tracker, we propose an aerial correlation filtering tracker based on scene-perceptual memory, Fast-AutoTrack. Firstly, to perceive and judge tracking anomalies, such as object occlusion and camera motion, inspired by the peak sidelobe ratio and AutoTrack, a confidence score is designed by perceiving and remembering the changing trend of the confidence and the local historical confidence. Secondly, after tracking anomaly occurring, several search regions are predicted based on the local object motion trend and the Spatio-temporal context information for object re-detection. Finally, to accelerate the model updating, the perceptual hashing algorithm (PHA) is used to obtain the similarity of the search regions between two adjacent frames. On typical aerial tracking datasets UAVDT, UAV123@10fps, and DTB70, Fast-AutoTrack run 71.4% faster than AutoTrack with almost equal accuracy and show favorable accuracy-speed trade-off.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available