4.8 Article

Deep Anomaly Detection for Time-Series Data in Industrial IoT: A Communication-Efficient On-Device Federated Learning Approach

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 8, Issue 8, Pages 6348-6358

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2020.3011726

Keywords

Deep anomaly detection (DAD); federated learning (FL); gradient compression; Industrial Internet of Things

Funding

  1. Alibaba Group through the Alibaba Innovative Research (AIR) Program
  2. AlibabaNTU Singapore Joint Research Institute (JRI), NTU, Singapore
  3. State Key Laboratory of Industrial Control Technology, Zhejiang University, China [ICT20044]
  4. National Natural Science Foundation of China [51806157]
  5. Young Innovation Talents Project in Higher Education of Guangdong Province, China [2018KQNCX333]
  6. King Saud University, Riyadh, Saudi Arabia [RSP-2020/32]

Ask authors/readers for more resources

This article proposes a communication-efficient on-device federated learning (FL)-based deep anomaly detection framework for sensing time-series data in IIoT. The framework includes an FL framework, AMCNN-LSTM model, and gradient compression mechanism, which can improve generalization ability, accurately detect anomalies, and enhance communication efficiency.
Since edge device failures (i.e., anomalies) seriously affect the production of industrial products in Industrial IoT (IIoT), accurately and timely detecting anomalies are becoming increasingly important. Furthermore, data collected by the edge device contain massive user's private data, which is challenging current detection approaches as user privacy has attracted more and more public concerns. With this focus, this article proposes a new communication-efficient on-device federated learning (FL)-based deep anomaly detection framework for sensing time-series data in IIoT. Specifically, we first introduce an FL framework to enable decentralized edge devices to collaboratively train an anomaly detection model, which can improve its generalization ability. Second, we propose an attention mechanism-based convolutional neural network-long short-term memory (AMCNN-LSTM) model to accurately detect anomalies. The AMCNN-LSTM model uses attention mechanism-based convolutional neural network units to capture important fine-grained features, thereby preventing memory loss and gradient dispersion problems. Furthermore, this model retains the advantages of the long short-term memory unit in predicting time-series data. Third, to adapt the proposed framework to the timeliness of industrial anomaly detection, we propose a gradient compression mechanism based on Top-k selection to improve communication efficiency. Extensive experimental studies on four real-world data sets demonstrate that our framework accurately and timely detects anomalies and also reduces the communication overhead by 50% compared to the FL framework that does not use the gradient compression scheme.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available