4.6 Article

OneShotDA: Online Multi-Object Tracker With One-Shot-Learning-Based Data Association

期刊

IEEE ACCESS
卷 8, 期 -, 页码 38060-38072

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2020.2975912

关键词

Trajectory; Proposals; Feature extraction; Video sequences; Object recognition; Benchmark testing; Task analysis; Data association; deep learning; multi-object tracking; object recognition; one-shot learning

资金

  1. Institute of Information and Communications Technology Planning and Evaluation (IITP) - Korean Government (MSIT) [2014-3-00077]
  2. AI National Strategy Project
  3. National Research Foundation of Korea (NRF) - Korean Government (MSIT) [2019R1A2C2087489, 2017R1D1A1B03036423]
  4. Institute for Information & Communication Technology Planning & Evaluation (IITP), Republic of Korea [2014-3-00077-007] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)
  5. National Research Foundation of Korea [2017R1D1A1B03036423, 2019R1A2C2087489] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

向作者/读者索取更多资源

Tracking multiple objects in a video sequence can be accomplished by identifying the objects appearing in the sequence and distinguishing between them. Therefore, many recent multi-object tracking (MOT) methods have utilized re-identification and distance metric learning to distinguish between objects by computing the similarity/dissimilarity scores. However, it is difficult to generalize such approaches for arbitrary video sequences, because some important information, such as the number of objects (classes) in a video, is not known in advance. Therefore, in this study, we applied a one-shot learning framework to the MOT problem. Our algorithm tracks objects by classifying newly observed objects into existing tracks, irrespective of the number of objects appearing in a video frame. The proposed method, called OneShotDA, exploits the one-shot learning framework based on an attention mechanism. Our neural network learns to classify unseen data samples using labels from a support set. Once the network has been trained, it predicts correct labels for newly received detection results based on the set of existing tracks. To analyze the effectiveness of our method, it was tested on the MOTchallenge benchmark datasets (MOT16 and MOT17 datasets). The results reveal that the performance of the proposed method was comparable with those of current state-of-the-art methods. In particular, it is noteworthy that the proposed method ranked first among the online trackers on the MOT17 benchmark.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据