☆ 4.7 Article

SSPNet: Learning spatiotemporal saliency prediction networks for visual tracking q

INFORMATION SCIENCES (2021)

Journal

INFORMATION SCIENCES

Volume 575, Issue -, Pages 399-416

Publisher

ELSEVIER SCIENCE INC

DOI: 10.1016/j.ins.2021.06.042

Keywords

Visual tracking; Spatiotemporal feature; 3D convolutional neural networks; Recurrent neural networks

Funding

National Research Foundation of Korea (NRF) - Korean Government (MSIT) [2019R1F1A1061941]
National Research Foundation of Korea [2019R1F1A1061941] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

The novel method SSPNet improves visual tracking accuracy by predicting the spatiotemporal features of the target, addressing limitations of traditional methods, particularly excelling in challenging sequences.

We present SSPNet, a novel method for learning the spatiotemporal saliency of a target for visual tracking. State-of-the-art trackers typically track targets by predicting the target state, ie coordinates of a bounding box encompassing the target, from the target candidates sampled around a previous target state. However, they have two limitations: 1) vulnerability to tracking distractors present in a frame and 2) the strong bias of the target state estimation to the initial frame. The proposed method addresses this problem by predicting the spatiotemporal features of the target so-called current and future target saliencies. Given a frame, the current target saliency represents the spatial aspect of the target, whereas the future target saliency depicts how the target will appear in the next frame based on the temporal features. This technique improves tracking accuracy in two ways: we can exploit the similarity between the current and the future target saliencies to detect the distractors. Further, SSPNet provides better prior knowledge about the current target state compared to using only the previous frame, which mitigates the bias to the initial frame and occlusion problem. We show that SSPNet outperforms the state-of-the-art trackers, particularly in challenging sequences. (c) 2021 Elsevier Inc. All rights reserved.

SSPNet: Learning spatiotemporal saliency prediction networks for visual tracking q

Journal

INFORMATION SCIENCES

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

SSPNet: Learning spatiotemporal saliency prediction networks for visual tracking q

Journal

INFORMATION SCIENCES

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper