4.6 Article

Sensors Integrated Control of PEMFC Gas Supply System Based on Large-Scale Deep Reinforcement Learning

期刊

SENSORS
卷 21, 期 2, 页码 -

出版社

MDPI
DOI: 10.3390/s21020349

关键词

distributed deep reinforcement learning; edge-cloud collaborative multiple tricks distributed deep deterministic policy gradient; PEMFC; integrated control of gas supply system

资金

  1. National Natural Science Foundation of China [51777078]

向作者/读者索取更多资源

A integrated controller of the PEMFC gas supply system based on DDRL was proposed to address the coordination issue between airflow and hydrogen flow. The ECMTD-DDPG algorithm introduced an edge exploration policy, improving distributed exploration in the environment and achieving better control performance.
In the proton exchange membrane fuel cell (PEMFC) system, the flow of air and hydrogen is the main factor influencing the output characteristics of PEMFC, and there is a coordination problem between their flow controls. Thus, the integrated controller of the PEMFC gas supply system based on distributed deep reinforcement learning (DDRL) is proposed to solve this problem, it combines the original airflow controller and hydrogen flow controller into one. Besides, edge-cloud collaborative multiple tricks distributed deep deterministic policy gradient (ECMTD-DDPG) algorithm is presented. In this algorithm, an edge exploration policy is adopted, suggesting that the edge explores including DDPG, soft actor-critic (SAC), and conventional control algorithm are employed to realize distributed exploration in the environment, and a classified experience replay mechanism is introduced to improve exploration efficiency. Moreover, various tricks are combined with the cloud centralized training policy to address the overestimation of Q-value in DDPG. Ultimately, a model-free integrated controller of the PEMFC gas supply system with better global searching ability and training efficiency is obtained. The simulation verifies that the controller enables the flows of air and hydrogen to respond more rapidly to the changing load.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据