4.8 Article

Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for ISOP-DAB Converter in DC Microgrid: A Multiagent Deep Reinforcement Learning-Based Method

Journal

IEEE TRANSACTIONS ON POWER ELECTRONICS
Volume 38, Issue 3, Pages 2985-3000

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TPEL.2022.3218900

Keywords

Microgrids; Voltage control; Stress; Uncertainty; Minimization; Inductors; Training; Input-series output-parallel-connected dual active bridge (ISOP-DAB) converter; input voltage sharing (IVS); multiagent twin-delayed deep deterministic policy gradient (MA-TD3); triple phase shift modulation

Ask authors/readers for more resources

This article proposes a multiagent deep reinforcement learning (DRL) based approach for controlling input voltage sharing and modulation in an input-series output-parallel dual active bridge converter. The proposed method addresses the challenges of uncertainties in the dc microgrid, power balance, and current stress minimization. The control and modulation problem is formulated as a Markov game and solved using the MA twin-delayed deep deterministic policy gradient (MA-TD3) algorithm. Simulation and experimental results demonstrate the effectiveness of the proposed approach.
This article proposes a multiagent (MA) deep reinforcement learning (DRL) based autonomous input voltage sharing (IVS) control and triple phase shift modulation method for input-series output-parallel (ISOP) dual active bridge (DAB) converters to solve the three challenges: the uncertainties of the dc microgrid, the power balance problem, and the current stress minimization of the converter. Specifically, the control and modulation problem of the ISOP-DAB converter is formed as a Markov game with several DRL agents. Subsequently, the MA twin-delayed deep deterministic policy gradient (MA-TD3) algorithm is applied to train the DRL agents in an offline manner. After the training process, the multiple agents can provide online control decisions for the ISOP-DAB converter to balance the IVS, and minimize the current stress among different submodules. Without accurate model information, the proposed method can adaptively obtain the optimal modulation variable combinations in a stochastic and uncertain environment. Simulation and experimental results verify the effectiveness of the proposed MA-TD3-based algorithm.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available