4.7 Article

Online Multi-Face Tracking With Multi-Modality Cascaded Matching

Related references

Note: Only part of the references are listed.
Article Computer Science, Artificial Intelligence

Distilled Siamese Networks for Visual Tracking

Jianbing Shen et al.

Summary: This paper introduces a distilled Siamese tracking framework, which learns small, fast, and accurate trackers through a teacher-student knowledge distillation model from large Siamese trackers. The proposed framework achieves high compression rates and frame rates while maintaining tracking accuracy by utilizing teacher-student distillation and student-student knowledge sharing.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Computer Science, Artificial Intelligence

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

Jonathon Luiten et al.

Summary: The higher order tracking accuracy (HOTA) is a novel evaluation metric for multi-object tracking that balances accurate detection, association, and localization. It decomposes into sub-metrics to evaluate different error types separately, providing clear analysis of tracking performance. The HOTA scores align better with human visual evaluation of tracking performance compared to established metrics.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2021)

Review Computer Science, Artificial Intelligence

Multiple object tracking: A literature review

Wenhan Luo et al.

Summary: This review comprehensively examines the problem of Multiple Object Tracking (MOT) and proposes interesting directions for future research. By analyzing existing methods and experimental results, some fundamental agreements in the field have been verified.

ARTIFICIAL INTELLIGENCE (2021)

Article Engineering, Electrical & Electronic

A Survey of Multiple Pedestrian Tracking Based on Tracking-by-Detection Framework

Zhihong Sun et al.

Summary: This paper provides a comprehensive survey of recent advances in TBD-based MPT algorithms, analyzing existing algorithms systematically and organizing the survey into four major parts. It covers milestones of TBD-based works, main procedures of the TBD framework, performance evaluation on MOT challenge datasets, and discussions on open issues and future directions in the TBD framework.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2021)

Article Computer Science, Artificial Intelligence

FairMOT: On the Fairness of Detection and Re-identification in Multiple Object Tracking

Yifu Zhang et al.

Summary: Multi-object tracking is a crucial problem in computer vision, and formulating it as multi-task learning of object detection and re-ID in a single network can lead to joint optimization of the two tasks. However, competition between the tasks needs to be addressed, and the proposed FairMOT method based on CenterNet architecture achieves high accuracy for both detection and tracking through detailed designs and empirical studies.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2021)

Article Computer Science, Artificial Intelligence

Recent advances of single-object tracking methods: A brief survey

Yucheng Zhang et al.

Summary: This paper summarizes single-object tracking algorithms based on correlation filters and deep learning, explaining the definition, components, and development trends of these algorithms over the past decade.

NEUROCOMPUTING (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Face, Body, Voice: Video Person-Clustering with Multiple Modalities

Andrew Brown et al.

Summary: The goal of this work is person-clustering in videos, where characters are grouped based on their identities. The study introduces a Multi-Modal High-Precision Clustering algorithm and a Video Person-Clustering dataset, demonstrating the effectiveness of using multiple modalities for person-clustering and exploring the application of this task for story understanding.

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Partially View-aligned Representation Learning with Noise-robust Contrastive Loss

Mouxing Yang et al.

Summary: The proposed method addresses the Partially View-aligned Problem by simultaneously learning representation and aligning data using a noise-robust contrastive loss. It constructs positive and negative pairs, and uses a loss function to prevent false negatives, resulting in successful application in multi-view clustering and classification tasks with promising performance.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Article Computer Science, Artificial Intelligence

Tracking Persons-of-Interest via Unsupervised Representation Adaptation

Shun Zhang et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2020)

Article Computer Science, Artificial Intelligence

Self-supervised on-line cumulative learning from video streams

Federico Pernici et al.

COMPUTER VISION AND IMAGE UNDERSTANDING (2020)

Article Automation & Control Systems

Visual Object Tracking by Hierarchical Attention Siamese Network

Jianbing Shen et al.

IEEE TRANSACTIONS ON CYBERNETICS (2020)

Article Engineering, Electrical & Electronic

Deep Continuous Conditional Random Fields With Asymmetric Inter-Object Constraints for Online Multi-Object Tracking

Hui Zhou et al.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2019)

Article Automation & Control Systems

Multiobject Tracking by Submodular Optimization

Jianbing Shen et al.

IEEE TRANSACTIONS ON CYBERNETICS (2019)

Article Engineering, Electrical & Electronic

Heterogeneous Association Graph Fusion for Target Association in Multiple Object Tracking

Hao Sheng et al.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2019)

Article Engineering, Electrical & Electronic

Iterative Multiple Hypothesis Tracking With Tracklet-Level Association

Hao Sheng et al.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Language Features Matter: Effective Language Representations for Vision-Language Tasks

Andrea Burns et al.

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) (2019)

Article Engineering, Civil

Fast Online Tracking With Detection Refinement

Jianbing Shen et al.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Memory Based Online Learning of Deep Representations from Video Streams

Federico Pernici et al.

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Fusion of Head and Full-Body Detectors for Multi-Object Tracking

Roberto Henschel et al.

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

VGGFace2: A dataset for recognising faces across pose and age

Qiong Cao et al.

PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

SphereFace: Deep Hypersphere Embedding for Face Recognition

Weiyang Liu et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Tracking Persons-of-Interest via Adaptive Discriminative Features

Shun Zhang et al.

COMPUTER VISION - ECCV 2016, PT V (2016)

Proceedings Paper Computer Science, Artificial Intelligence

MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

Yandong Guo et al.

COMPUTER VISION - ECCV 2016, PT III (2016)

Article Computer Science, Artificial Intelligence

Tracking-Learning-Detection

Zdenek Kalal et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2012)

Article Engineering, Electrical & Electronic

Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics

Keni Bernardin et al.

EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING (2008)