4.8 Article

Deep Reinforcement Learning Aided Variable-Frequency Triple-Phase-Shift Control for Dual-Active-Bridge Converter

期刊

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS
卷 70, 期 10, 页码 10506-10515

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIE.2022.3220893

关键词

Control systems; Switching frequency; Phase modulation; Inductors; Bridge circuits; Zero voltage switching; Harmonic analysis; Deep reinforcement learning (DRL); dual active bridge (DAB); triple phase shift (TPS); twin delayed deep deterministic policy gradient (TD3); varying switching frequency

向作者/读者索取更多资源

This article proposes an improvement approach for the dual-active-bridge converter using a variable-frequency triple-phase-shift control strategy with the help of deep reinforcement learning. The twin delayed deep deterministic policy gradient algorithm is adopted to train the agent offline, aiming to minimize power losses under varying switching frequencies. The training of the algorithm incorporates zero-voltage switching performance. Based on this, the trained agent acts as a fast surrogate predictor, generating appropriate control strategies in real-time for continuous operation with soft switching and maximum conversion efficiency. The effectiveness and correctness of the proposed scheme are validated through experimental results in a laboratory prototype.
To improve the conversion efficiency of the dual-active-bridge converter, this article demonstrates a variable-frequency triple-phase-shift (TPS) control strategy with the help of the deep reinforcement learning method. More specifically, the twin delayed deep deterministic policy gradient (TD3) algorithm is adopted to train the agent offline with the aim of minimum power losses, under the TPS modulation with varying switching frequency. Moreover, the zero-voltage-switching performance has been considered during the training of the TD3 algorithm. Based on these, the trained TD3 agent acts as a fast surrogate predictor, which can produce appropriate control strategies in real-time for whole continuous operating conditions with soft switching and maximum conversion efficiency. The effectiveness and correctness of the proposed scheme is validated through the experimental results in a laboratory prototype.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据