4.8 Article

Resource-Efficient Visual Multiobject Tracking on Embedded Device

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 9, Issue 11, Pages 8531-8543

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2021.3115102

Keywords

Appearance representation; deep learning; edge video analytics; Internet of Things (IoT); multiobject tracking (MOT)

Funding

  1. National Key Research and Development Program of China [2018YFB1702100]
  2. Natural Science Foundation of China [62025305, 61933009]
  3. Program of Shanghai Academic Research Leader [19XD1421800]

Ask authors/readers for more resources

This article proposes a resource-efficient MOT method for IoT embedded devices, achieving real-time surveillance with low latency through optimization strategies and model compression.
Multiobject tracking (MOT) is a crucial technology for security surveillance, which is computationally intensive due to the requirement of processing a large number of video streams within low latency in practice. The input video streams of MOT are processed on a cloud computing center with abundant computational capability, posing heavy pressures on delivering video streams to the cloud. Recent advances in the Internet-of-Things (IoT) technology provide edge-computing-based solutions for video analytics at scale. However, the gap between MOT's high computational capability demand and IoT devices' resource-constrained nature remains significant. In this article, a resource-efficient MOT (REMOT) method is proposed for real-time surveillance on IoT embedded devices, including an affinity measurement based on an appearance model with angular triplet loss and a motion association that substitutes the time-consuming graph-based data association stage. Considering the tradeoff between latency and accuracy, we design an optimization strategy on the parallel processing of deep learning models' layers to accelerate the inference speed with less accuracy loss. Besides, we employ a model compression strategy for model size reduction. Experiments on MOT16 and MOT17 benchmarks demonstrate that REMOT reduces 2.4x latency compared with the original implementation and achieves a running speed of 81 frames per second (fps) on an embedded device with only a marginal accuracy loss (6 %), which meets the requirements of real-time processing and low-latency response for surveillance.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available