3.8 Proceedings Paper

ByteTrack: Multi-object Tracking by Associating Every Detection Box

Related references

Note: Only part of the references are listed.
Article Computer Science, Information Systems

RelationTrack: Relation-Aware Multiple Object Tracking With Decoupled Representation

En Yu et al.

Summary: Existing online MOT algorithms typically involve detection and re-identification (ReID) as two subtasks. In order to improve efficiency, these subtasks are commonly integrated into a unified framework. However, since detection and ReID require different features, this creates an optimization contradiction during training. To address this issue, we propose the Global Context Disentangling (GCD) module to separate the learned representation into detection-specific and ReID-specific embeddings. Additionally, we develop the Guided Transformer Encoder (GTE) module to capture global semantic relations more effectively, improving the overall performance of the MOT framework.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Artificial Intelligence

TransCenter: Transformers With Dense Representations for Multiple-Object Tracking

Yihong Xu et al.

Summary: Transformers have shown excellent performance in various tasks, but there is still room for improvement in multiple-object tracking. This paper proposes TransCenter, a transformer-based architecture that accurately tracks all objects using dense representations while maintaining reasonable runtime. TransCenter achieves remarkable performance improvements and outperforms state-of-the-art methods in two standard MOT benchmarks by utilizing dense image-related detection queries and efficient sparse tracking queries.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

VoxelTrack: Multi-Person 3D Human Pose Estimation and Tracking in the Wild

Yifu Zhang et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Proceedings Paper Computer Science, Artificial Intelligence

MOTR: End-to-End Multiple-Object Tracking with Transformer

Fangao Zeng et al.

Summary: This paper proposes MOTR, an extension of DETR that models tracked instances in the entire video using track queries and incorporates techniques such as tracklet-aware label assignment, temporal aggregation network, and collective average loss to enhance temporal relation modeling. Experimental results demonstrate that MOTR achieves significant improvements in multiple-object tracking.

COMPUTER VISION - ECCV 2022, PT XXVII (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Global Tracking Transformers

Xingyi Zhou et al.

Summary: This paper presents a novel transformer-based architecture for global multi-object tracking. The proposed architecture achieves global multi-object tracking without intermediate pairwise grouping or combinatorial association. Experimental results demonstrate that the architecture achieves competitive performance on the MOT17 benchmark and can seamlessly integrate into large-vocabulary detectors.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

TrackFormer: Multi-Object Tracking with Transformers

Tim Meinhardt et al.

Summary: This study proposes an end-to-end trainable multi-object tracking approach called TrackFormer, based on an encoder-decoder Transformer architecture. TrackFormer achieves outstanding performance in track initialization, identity, and spatio-temporal trajectory reasoning, and introduces the attention mechanism. Through self- and encoder-decoder attention on global frame-level features, additional graph optimization or modeling of motion and/or appearance is omitted.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

MeMOT: Multi-Object Tracking with Memory

Jiarui Cai et al.

Summary: This study proposes an online tracking algorithm that utilizes a large spatio-temporal memory to link objects after a long time span. The algorithm achieves excellent performance in object detection and data association tasks.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Article Computer Science, Artificial Intelligence

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

Jonathon Luiten et al.

Summary: The higher order tracking accuracy (HOTA) is a novel evaluation metric for multi-object tracking that balances accurate detection, association, and localization. It decomposes into sub-metrics to evaluate different error types separately, providing clear analysis of tracking performance. The HOTA scores align better with human visual evaluation of tracking performance compared to established metrics.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2021)

Article Computer Science, Artificial Intelligence

ReMOT: A model-agnostic refinement for multiple object tracking

Fan Yang et al.

Summary: The refinement method was studied for Multiple Object Tracking (MOT) tasks, defining Mix-up Error and Cut-off Error in imperfect tracklets, proposing the ReMOT framework to improve appearance features by splitting and merging tracklets, significantly improving MOT results, and assisting semi-automatic MOT data annotation.

IMAGE AND VISION COMPUTING (2021)

Article Computer Science, Artificial Intelligence

FairMOT: On the Fairness of Detection and Re-identification in Multiple Object Tracking

Yifu Zhang et al.

Summary: Multi-object tracking is a crucial problem in computer vision, and formulating it as multi-task learning of object detection and re-ID in a single network can lead to joint optimization of the two tasks. However, competition between the tasks needs to be addressed, and the proposed FairMOT method based on CenterNet architecture achieves high accuracy for both detection and tracking through detailed designs and empirical studies.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Detecting Invisible People

Tarasha Khurana et al.

Summary: The research introduces a new approach to detecting invisible objects, focusing on the case of people. By treating occluded object detection as a short-term forecasting challenge and building dynamic models, the tracking and detection of occluded objects is achieved. The performance improvement is significant compared to existing technologies.

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Learning to Track with Object Permanence

Pavel Tokmakov et al.

Summary: Traditional online multi-object tracking methods heavily rely on instantaneous observations, making them prone to failure when objects are not fully visible. In contrast, humans track objects based on the concept of object permanence. This study introduces an end-to-end trainable approach for joint object detection and tracking, which utilizes a large synthetic dataset for training and improves model robustness against occlusions.

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) (2021)

Proceedings Paper Automation & Control Systems

Joint Object Detection and Multi-Object Tracking with Graph Neural Networks

Yongxin Wang et al.

Summary: Object detection and data association are critical components in multi-object tracking systems. Recent works have shown that simultaneously optimizing detection and data association modules under a joint MOT framework can lead to improved performance. This study proposes a new instance of joint MOT approach based on Graph Neural Networks, which can model relations between variable-sized objects in both spatial and temporal domains, leading to state-of-the-art performance for both detection and MOT tasks.

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Improving Multiple Object Tracking with Single Object Tracking

Linyu Zheng et al.

Summary: This paper proposes a novel and end-to-end trainable MOT architecture that extends CenterNet by adding an SOT branch for tracking objects in parallel. This allows the MOT task to benefit from the strong discriminative power of SOT methods in an effective and efficient way, achieving high performance with a frame rate of 16 FPS on MOT17.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

SiamMOT: Siamese Multi-Object Tracking

Bing Shuai et al.

Summary: This paper introduces a region-based Siamese Multi-Object Tracking network called SiamMOT to improve online multi-object tracking by modeling motion. Experimental results show that SiamMOT performs superiorly in MOT and outperforms the winners of ACM MM'20 HiEve Grand Challenge on the HiEve dataset.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Article Computer Science, Artificial Intelligence

Object Detection in Videos by High Quality Object Linking

Peng Tang et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2020)

Article Computer Science, Information Systems

Multiplex Labeling Graph for Near-Online Tracking in Crowded Scenes

Yang Zhang et al.

IEEE INTERNET OF THINGS JOURNAL (2020)

Proceedings Paper Engineering, Electrical & Electronic

Online Multi-Object Tracking with Instance-Aware Tracker and Dynamic Model Refreshment

Peng Chu et al.

2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

CityPersons: A Diverse Dataset for Pedestrian Detection

Shanshan Zhang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Mask R-CNN

Kaiming He et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers

Fan Yang et al.

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2016)

Article Computer Science, Artificial Intelligence

Continuous Energy Minimization for Multitarget Tracking

Anton Milan et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2014)

Proceedings Paper Computer Science, Artificial Intelligence

Robust Online Multi-Object Tracking based on Tracklet Confidence and Online Discriminative Appearance Learning

Seung-Hwan Bae et al.

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2014)

Proceedings Paper Computer Science, Artificial Intelligence

The Way They Move: Tracking Multiple Targets with Similar Appearance

Caglayan Dicle et al.

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2013)

Article Engineering, Electrical & Electronic

Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics

Keni Bernardin et al.

EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING (2008)