Journal of Changjiang River Scientific Research Institute ›› 2025, Vol. 42 ›› Issue (6): 210-218.DOI: 10.11988/ckyyb.20240431

• Multi-Objective Optimization Scheduling for Reservoir Groups • Previous Articles    

Optimal Scheduling Method for Power Generation of Cascade Reservoirs Based on RLDE Algorithm

CHEN Jia-wen1,2(), ZHU Xin1,2, TANG Zheng-yang3, SHEN Ke-yan3, CHEN Xiao-lin1,2, QIN Hui1,2()   

  1. 1 School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan 430074,China
    2 Hubei Key Laboratory of Digital Valley Science and Technology, Huazhong University of Science andTechnology, Wuhan 430074, China
    3 Three Gorges Cascade Dispatch and Communication Center,China Yangtze Power Co., Ltd., Yichang 443000, China
  • Received:2024-04-29 Revised:2024-07-03 Published:2025-06-16 Online:2025-06-16
  • Contact: QIN Hui

Abstract:

[Objective] To address the shortcomings of differential evolution (DE) algorithms in cascade reservoir optimization, this study proposes an intelligent algorithm that couples reinforcement learning and differential evolution (RLDE). [Methods] The RLDE algorithm improved the standard DE algorithm through three key strategies: chaotic mapping to enhance initial solution quality, Q-learning-based adaptive parameter adjustment, and a variable step-size strategy. Specifically, (1) chaotic mapping enhanced the initial solution quality. Logistic mapping with the best experimental performance was selected and applied to the population initialization of the RLDE algorithm. (2) The adaptive parameter adjustment was conducted based on the Q-learning algorithm. (3) A variable step-size strategy was designed for the actions in the Q-table, where the precision of action rows gradually increased with the number of iterations. To validate the feasibility and effectiveness of the RLDE algorithm, it was applied to optimize the power generation scheduling model for four major cascade reservoirs (Wudongde, Baihetan, Xiluodu, and Xiangjiaba) on the lower Jinsha River. [Results] (1) The chaotic initialization strategy effectively improved the initial solution quality. The adaptive parameter adjustment strategy based on the Q-learning algorithm enabled the algorithm to continuously adapt by receiving feedback from the environment. This process enhanced population diversity, greatly mitigated problems such as premature convergence or population evolutionary stagnation found in the traditional DE algorithm, thereby improving optimization performance. The variable step-size strategy allowed the algorithm to better respond to environmental feedback, further strengthening the optimization capability of the algorithm. (2) Compared with the traditional DE algorithm and adaptive genetic algorithm, the RLDE algorithm achieved an average annual power generation increase of 2.02% and 2.06%, respectively, under three typical inflow scenarios (wet, normal, and dry). Moreover, the average standard deviation of the proposed algorithm after multiple runs was reduced by an average of 729 million kW·h compared with the traditional DE algorithm, and by 844 million kW·h compared with the adaptive genetic algorithm. [Conclusions] This study proposes an intelligent algorithm that integrates reinforcement learning with differential evolution, effectively addressing issues such as premature convergence and search stagnation in the traditional DE algorithm. The proposed method provides an efficient and reliable solution for the optimal scheduling of cascade reservoirs.

Key words: cascade reservoirs, optimal scheduling, differential evolution, reinforcement learning, adaptive parameter adjustment

CLC Number: 

Baidu
map