4.7 Article

Deep Q-network-based heuristic intrusion detection against edge-based SIoT zero-day attacks

Journal

APPLIED SOFT COMPUTING
Volume 150, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.asoc.2023.111080

Keywords

Social Internet of Things; Zero-day attacks; Intrusion detection system; Deep Q-Networks; Heuristic learning

Ask authors/readers for more resources

To address the damage caused by zero-day attacks on SIoT systems, researchers propose a heuristic learning intrusion detection system named DQN-HIDS. By integrating Deep Q-Networks (DQN) into the system, DQN-HIDS gradually improves its ability to identify malicious traffic and reduces resource workloads. Experiments demonstrate the superior performance of DQN-HIDS in terms of workload, delayed sample queue, rewards, and classifier accuracy.
How to process and classify zero-day attacks due to their huge damage to social Internet of Things (SIoT) systems has become a hot research issue. To solve this issue, we propose a heuristic learning intrusion detection system with Deep Q-Networks (DQN) for edge-based SIoT networks under the scenario of insufficient training samples, which is named DQN-HIDS. It is composed of an SIoT network traffic processing module and a DQN-based heuristic learning network. The SIoT network traffic processing module generates SIoT traffic samples, selects samples entering a classifier and a cybersecurity examiner center, and outputs similarity. We integrate DQN into a heuristic learning network to gradually improve its ability to identify malicious traffic. Specially, reward functions are designed according to the selected actions of the network, in order to punish the behavior of incorrectly labeling malicious samples and make variable reward functions adapt to different execution actions. The LSTM-based DQN then maximizes the cumulative expected reward to find the optimal strategy for the heuristic learning network. Consequently, DQN-HIDS gradually improves the behavior frequency of its labeling, reduces resource workloads, and increases the ability to label SIoT network traffic. Experiments show the performance of DQN-HIDS in terms of the workload of the examiner center and the queue workload of delayed samples, the rewards obtained by the DQN-based heuristic learning network, and the accuracy of the classifier. Comparisons with a state-of-the-art deep learning model and typical machine learning methods are also made, demonstrating the advantages of DQN-HIDS with fewer SIoT network traffic samples.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available