4.7 Article

Artificial Intelligence of Things-assisted two-stream neural network for anomaly detection in surveillance Big Video Data

Publisher

ELSEVIER
DOI: 10.1016/j.future.2021.10.033

Keywords

Anomaly detection; Two-stream network; Anomaly recognition; Surveillance videos

Funding

  1. National Research Foundation of Korea (NRF) - Korea government (MSIT) [2019R1A2B5B01070067]

Ask authors/readers for more resources

This work presents an efficient and robust framework for recognizing anomalies in surveillance Big Video Data (BVD) using Artificial Intelligence of Things (AIoT). The framework consists of two phases: instant anomaly detection and detailed anomaly analysis. A two-stream neural network is proposed to model and classify the anomalies. Extensive experiments show that the framework outperforms state-of-the-art methods in terms of accuracy.
In the last few years, visual sensors are deployed almost everywhere, generating a massive amount of surveillance video data in smart cities that can be inspected intelligently to recognize anomalous events. In this work, we present an efficient and robust framework to recognize anomalies from surveillance Big Video Data (BVD) using Artificial Intelligence of Things (AIoT). Smart surveillance is an important application of AIoT and we propose a two-stream neural network in this direction. The first stream comprises instant anomaly detection that is functional over resource-constrained IoT devices, whereas second phase is a two-stream deep neural network allowing for detailed anomaly analysis, suited to be deployed as a cloud computing service. Firstly, a self-pruned fine-tuned lightweight convolutional neural network (CNN) classifies the ongoing events as normal or anomalous in an AIoT environment. Upon anomaly detection, the edge device alerts the concerned departments and transmits the anomalous frames to cloud analysis center for their detailed evaluation in the second phase. The cloud analysis center resorts to the proposed two-stream network, modeled from the integration of spatiotemporal and optical flow features through the sequential frames. Fused features flow through a bi-directional long short-term memory (BD-LSTM) layer, which classifies them into their respective anomaly classes, e.g., assault and abuse. We perform extensive experiments over benchmarks built on top of the UCF-Crime and RWF-2000 datasets to test the effectiveness of our framework. We report a 9.88% and 4.01% increase in accuracy when compared to state-of-the-art methods evaluated over the aforementioned datasets. (c) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available