4.7 Article

Optimal mileage-based PV array reconfiguration using swarm reinforcement learning

Journal

ENERGY CONVERSION AND MANAGEMENT
Volume 232, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.enconman.2021.113892

Keywords

PV array reconfiguration; Regulation mileage; Partial shading condition; Swarm reinforcement learning; Real-time generation scheduling

Funding

  1. National Natural Science Foundation of China [51907112, 61963020, 51777078, U2066212]
  2. Natural Science Foundation of Guangdong Province of China [2019A1515011671]
  3. Research and Development Start-Up Foundation of Shantou University [NTF19001]

Ask authors/readers for more resources

This paper proposes a new optimal mileage-based PV array reconfiguration method to improve the power output of a PV power plant and reduce additional capacity and mileage costs in a performance-based frequency regulation market. By using swarm reinforcement learning and fast interior point method, the proposed method outperforms other methods in terms of total benefit under various partial shading conditions.
This paper constructs a new optimal mileage-based PV array reconfiguration (OMAR) in a PV power plant under partial shading conditions. It aims to maximize the power output of a PV power plant, and minimize the additional capacity and mileage payments resulting from the power fluctuation in a performance-based frequency regulation market. To reduce the optimization difficulty of OMAR, it is decomposed into two optimization sub-problems, including an upper-layer discrete optimization of PV array reconfiguration and a lower-layer continuous optimization of real-time generation scheduling. The upper-layer discrete optimization is addressed by the proposed swarm reinforcement learning (SRL), which can implement an efficient exploration and exploitation with multiple cooperative agents instead of a single learning agent. The rest lower-layer optimization is handled by the fast interior point method. The proposed method's effectiveness is thoroughly evaluated on the 10 x 10 total-cross-tied PV arrays under various partial shading conditions. Simulation results demonstrate that the proposed SRL can obtain a larger total benefit than genetic algorithm (GA), particle swarm optimization (PSO), grasshopper optimization algorithm (GOA), harris hawks optimizer (HHO), butterfly optimization algorithm (BOA), and Q-learning, in which the benefit increment can reach from 2.12% (against PSO) to 10.62% (against Q-learning).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available