4.6 Article

Data Integrity Attack in Dynamic State Estimation of Smart Grid: Attack Model and Countermeasures

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TASE.2022.3149764

Keywords

Data integrity; Smart grids; State estimation; Power system dynamics; Feature extraction; Data models; Markov processes; Cyber-physical system; smart grid; data integrity attack; attack detection; deep reinforcement learning

Funding

  1. National Natural Science Foundation of China [62173268, 61803295, 61973247, 61673315]
  2. Major Research Plan of the National Natural Science Foundation of China [61833015]
  3. National Postdoctoral Innovative Talents Support Program of China [BX20200272]
  4. National Key Research and Development Program of China [2019YFB1704103]
  5. China Postdoctoral Science Foundation [2018M643659]

Ask authors/readers for more resources

A deep reinforcement learning-based approach is proposed in this paper to detect data integrity attacks in a smart grid. The approach utilizes the state features of previous time steps to determine whether the system is under attack, and employs noisy networks and prioritized experience replay to improve detection performance.
A smart grid integrates advanced sensors, efficient measurement methods, progressive control technologies, and other techniques and devices to achieve safe, efficient and economical operation of the grid system. However, the diversified and open environment of a smart grid makes energy and information of the smart grid vulnerable to malicious attacks. As a representative cyber-physical attack, the data integrity attack has an extremely severe impact on the grid operation for it can bypass the traditional detection mechanisms by adjusting the attack vector. In this paper, we first present the attack strategy against dynamic state estimation of power grid in the perspective of adversary and formulate the data integrity attack detection problem that has the characteristic of sequential decision making as a partially observable Markov decision process. Then, a deep reinforcement learning-based approach is proposed to detect against data integrity attacks, which utilizes the Long Short-Term Memory layer to extract the state features of previous time steps in determining whether the system is currently under attack. Moreover, the noisy networks are employed to ensure effective agent exploration, which prevents the agent from sticking to the non-optimal policy. The principle of a multi-step learning is adopted to increase the estimation accuracy of Q value. To address the sparse rewards problem, the prioritized experience replay is proposed to increase training efficiency. Simulation results demonstrated that the proposed detection approach surpasses the benchmarks in the comparison metrics: delay error rate and false rate.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available