4.7 Article

SSPNet: Learning spatiotemporal saliency prediction networks for visual tracking q

Journal

INFORMATION SCIENCES
Volume 575, Issue -, Pages 399-416

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2021.06.042

Keywords

Visual tracking; Spatiotemporal feature; 3D convolutional neural networks; Recurrent neural networks

Funding

  1. National Research Foundation of Korea (NRF) - Korean Government (MSIT) [2019R1F1A1061941]
  2. National Research Foundation of Korea [2019R1F1A1061941] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

The novel method SSPNet improves visual tracking accuracy by predicting the spatiotemporal features of the target, addressing limitations of traditional methods, particularly excelling in challenging sequences.
We present SSPNet, a novel method for learning the spatiotemporal saliency of a target for visual tracking. State-of-the-art trackers typically track targets by predicting the target state, ie coordinates of a bounding box encompassing the target, from the target candidates sampled around a previous target state. However, they have two limitations: 1) vulnerability to tracking distractors present in a frame and 2) the strong bias of the target state estimation to the initial frame. The proposed method addresses this problem by predicting the spatiotemporal features of the target so-called current and future target saliencies. Given a frame, the current target saliency represents the spatial aspect of the target, whereas the future target saliency depicts how the target will appear in the next frame based on the temporal features. This technique improves tracking accuracy in two ways: we can exploit the similarity between the current and the future target saliencies to detect the distractors. Further, SSPNet provides better prior knowledge about the current target state compared to using only the previous frame, which mitigates the bias to the initial frame and occlusion problem. We show that SSPNet outperforms the state-of-the-art trackers, particularly in challenging sequences. (c) 2021 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available