☆ 4.7 Article

Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-Based Beam Search

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

卷 31, 期 -, 页码 6239-6254

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIP.2022.3208437

关键词

Target tracking; Tracking; Visualization; Search problems; Reinforcement learning; Trajectory; Decision making; Visual tracking; multi-agent reinforcement learning; beam search; local and global search; greedy search

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

National Natural Science Foundation of China [62102205, 62076003]
University Synergy Innovation Program of Anhui Province [GXXT-2021-038]
Major Project for New Generation of AI [2018AAA0100400]
Anhui Provincial Key Research and Development Program [2022i01020014]
Australian Research Council (ARC) [IH180100002]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a novel multi-agent reinforcement learning based beam search tracking strategy, called BeamTracking, to tackle the limitations of traditional visual trackers in challenging scenarios. By maintaining multiple tracking trajectories and applying beam search, accumulated errors can be reduced and tracking accuracy and reliability can be improved.

To track the target in a video, current visual trackers usually adopt greedy search for target object localization in each frame, that is, the candidate region with the maximum response score will be selected as the tracking result of each frame. However, we found that this may be not an optimal choice, especially when encountering challenging tracking scenarios such as heavy occlusion and fast motion. In particular, if a tracker drifts, errors will be accumulated and would further make response scores estimated by the tracker unreliable in future frames. To address this issue, we propose to maintain multiple tracking trajectories and apply beam search strategy for visual tracking, so that the trajectory with fewer accumulated errors can be identified. Accordingly, this paper introduces a novel multi-agent reinforcement learning based beam search tracking strategy, termed BeamTracking. It is mainly inspired by the image captioning task, which takes an image as input and generates diverse descriptions using beam search algorithm. Accordingly, we formulate the tracking as a sample selection problem fulfilled by multiple parallel decision-making processes, each of which aims at picking out one sample as their tracking result in each frame. Each maintained trajectory is associated with an agent to perform the decision-making and determine what actions should be taken to update related information. More specifically, using the classification-based tracker as the baseline, we first adopt bi-GRU to encode the target feature, proposal feature, and its response score into a unified state representation. The state feature and greedy search result are then fed into the first agent for independent action selection. Afterwards, the output action and state features are fed into the subsequent agent for diverse results prediction. When all the frames are processed, we select the trajectory with the maximum accumulated score as the tracking result. Extensive experiments on seven popular tracking benchmark datasets validated the effectiveness of the proposed algorithm.

Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-Based Beam Search

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-Based Beam Search

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文