4.5 Article

Twin-Delayed Deep Deterministic Policy Gradient for Low-Frequency Oscillation Damping Control

Journal

ENERGIES
Volume 14, Issue 20, Pages -

Publisher

MDPI
DOI: 10.3390/en14206695

Keywords

latency; twin-delayed deep deterministic policy gradient; damping control; wide-area measurement systems; low-frequency oscillations

Categories

Funding

  1. Collaborative Research of Learning and Optimizing Power Systems: A Geometric Approach [1810537]
  2. CAREER award of Faithful, Reducible, and Invertible Learning in Distribution System for Power Flow [2048288]

Ask authors/readers for more resources

This study aims to address damping control issues under unknown latency in power networks by redesigning the learning structure and developing a new method. Proposed novel reward algorithm optimizes system stability and reliability by considering various factors.
Due to the large scale of power systems, latency uncertainty in communications can cause severe problems in wide-area measurement systems. To resolve this issue, a significant amount of past work focuses on using emerging technology, including machine learning methods such as Q-learning, for addressing latency issues in modern controls. Although the method can deal with the stochastic characteristics of communication latency, the Q-values can be overestimated in Q-learning methods, leading to high bias. To address the overestimation bias issue, we redesign the learning structure of the deep deterministic policy gradient (DDPG). Then we develop a damping control twin-delayed deep deterministic policy gradient method to handle the damping control issue under unknown latency in the power network. The purpose is to address the damping control issue under unknown latency in the power network. This paper will create a novel reward algorithm, taking into account the machine speed deviation, the episode termination prevention, and the feedback from action space. In this way, the system optimally damps down frequency oscillations while maintaining the system's stability and reliable operation within defined limits. The simulation results verify the proposed algorithm in various perspectives, including the latency sensitivity analysis under high renewable energy penetration and the comparison with conventional and machine learning control algorithms. The proposed method shows a fast learning curve and good control performance under varying communication latency.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available