4.7 Article

Deep Reinforcement Learning Control for Pulsed Power Load Online Deployment in DC Shipboard Integrated Power System

Journal

IEEE TRANSACTIONS ON POWER SYSTEMS
Volume 38, Issue 4, Pages 3557-3567

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TPWRS.2022.3201370

Keywords

DC shipboard integrated power system; deep reinforcement learning; optimal control; pulsed power load

Ask authors/readers for more resources

In this paper, a deep reinforcement learning (DRL) optimal control method is proposed to address the online deployment problem of pulsed power load (PPL) in DC shipboard integrated power systems (SIPSs). The method adopts a stack-based state observation technique to enhance learning and control performance, and employs a multi-objective reward function design. The performance of the proposed DRL control is validated by case studies considering different load conditions.
Online deployment of pulsed power load (PPL) is one of the most challenging issues in DC shipboard integrated power systems (SIPSs), which leads to a multi-objective optimal control problem subject to various constraints in this paper. Since traditional model-based methods face difficulties in designing the optimal control policy and are prone to model inaccuracy and parameter uncertainty, there is an urgent need for a model-free and also high-performance control approach. Thus, a deep reinforcement learning (DRL) optimal control, which employs the twin-delayed deep deterministic policy gradient (TD3) algorithm, is presented in this paper. The DRL optimal control adopts a stack-based state observation technique to enhance learning and control performance, and it uses a multi-objective reward function design to signify the overall dynamic performance. Besides achieving the safe and fast online deployment of PPL, it also fulfills the regulation of DC bus voltage and the proportional current sharing among distributed generations (DGs). Moreover, the DRL control has an advantage in handling the ramp rate constraints of SIPS. The optimal control satisfying ramp rate constraints can be obtained through a deep learning process. The performance of the proposed DRL control is validated by case studies considering different load conditions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available