基于RLDE算法的梯级水库发电优化调度方法

doi:10.11988/ckyyb.20240431

raybet体育在线院报 ›› 2025, Vol. 42 ›› Issue (6): 210-218.DOI: 10.11988/ckyyb.20240431

• 水库群多目标优化调度研究专栏 • 上一篇

基于RLDE算法的梯级水库发电优化调度方法

陈佳雯¹^,²(), 祝欣¹^,², 汤正阳³, 沈柯言³, 陈晓淋¹^,², 覃晖¹^,²()

¹ 华中科技大学土木与水利工程学院,武汉 430074
² 华中科技大学数字流域科学与技术湖北省重点实验室,武汉 430074
³ 三峡水利枢纽梯级调度通信中心,湖北宜昌 443000

收稿日期:2024-04-29 修回日期:2024-07-03 出版日期:2025-06-16 发布日期:2025-06-16
通信作者:
覃晖(1983-),男,湖北宜城人,教授,博士,研究方向为水库群多目标优化调度。E-mail: hqin@hust.edu.cn
作者简介:
陈佳雯(1998-),女,福建龙岩人,硕士研究生,研究方向为水库群优化调度。E-mail: m202271582@hust.edu.cn
基金资助:
国家重点研发计划项目(2021YFC3200303); 水利部重大科技项目(SKS-2022120); 湖北省自然科学基金联合基金重点项目(2022CFD027); 中国长江电力股份有限公司资助项目(Z242302044)

Optimal Scheduling Method for Power Generation of Cascade Reservoirs Based on RLDE Algorithm

CHEN Jia-wen¹^,²(), ZHU Xin¹^,², TANG Zheng-yang³, SHEN Ke-yan³, CHEN Xiao-lin¹^,², QIN Hui¹^,²()

¹ School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan 430074,China
² Hubei Key Laboratory of Digital Valley Science and Technology, Huazhong University of Science andTechnology, Wuhan 430074, China
³ Three Gorges Cascade Dispatch and Communication Center,China Yangtze Power Co., Ltd., Yichang 443000, China

Received:2024-04-29 Revised:2024-07-03 Published:2025-06-16 Online:2025-06-16

摘要/Abstract

摘要：

梯级水库群联合运用可以充分发挥流域综合利用价值,但同时梯级水库群优化调度是不易求解复杂的系统性问题,差分进化(DE)算法是一种基于群体差异的启发式并行搜索方法,具有非常优秀的寻优能力,常被应用于水库优化调度模型的求解,但传统DE算法的参数设定及进化策略常由经验确定易出现早熟收敛或搜索停滞等现象。针对DE算法常见问题,提出了耦合强化学习与差分进化的智能算法(RLDE),该算法采用混沌映射提高初始解质量,并通过Q-learning算法实现自适应参数调整从而增加个体多样性,避免早熟收敛问题,同时由于Q-learning算法不断与环境交互反馈的机制,很大程度上降低了搜索停滞的风险。金沙江下游流域实践结果表明:RLDE算法相较于DE算法及自适应遗传算法(AGA)具有优秀的全局寻优能力及鲁棒性,能够有效求解梯级水库群发电优化调度模型,具有一定的工程实际应用价值。

关键词: 梯级水库群, 优化调度, 差分进化, 强化学习, 自适应调参

Abstract:

[Objective] To address the shortcomings of differential evolution (DE) algorithms in cascade reservoir optimization, this study proposes an intelligent algorithm that couples reinforcement learning and differential evolution (RLDE). [Methods] The RLDE algorithm improved the standard DE algorithm through three key strategies: chaotic mapping to enhance initial solution quality, Q-learning-based adaptive parameter adjustment, and a variable step-size strategy. Specifically, (1) chaotic mapping enhanced the initial solution quality. Logistic mapping with the best experimental performance was selected and applied to the population initialization of the RLDE algorithm. (2) The adaptive parameter adjustment was conducted based on the Q-learning algorithm. (3) A variable step-size strategy was designed for the actions in the Q-table, where the precision of action rows gradually increased with the number of iterations. To validate the feasibility and effectiveness of the RLDE algorithm, it was applied to optimize the power generation scheduling model for four major cascade reservoirs (Wudongde, Baihetan, Xiluodu, and Xiangjiaba) on the lower Jinsha River. [Results] (1) The chaotic initialization strategy effectively improved the initial solution quality. The adaptive parameter adjustment strategy based on the Q-learning algorithm enabled the algorithm to continuously adapt by receiving feedback from the environment. This process enhanced population diversity, greatly mitigated problems such as premature convergence or population evolutionary stagnation found in the traditional DE algorithm, thereby improving optimization performance. The variable step-size strategy allowed the algorithm to better respond to environmental feedback, further strengthening the optimization capability of the algorithm. (2) Compared with the traditional DE algorithm and adaptive genetic algorithm, the RLDE algorithm achieved an average annual power generation increase of 2.02% and 2.06%, respectively, under three typical inflow scenarios (wet, normal, and dry). Moreover, the average standard deviation of the proposed algorithm after multiple runs was reduced by an average of 729 million kW·h compared with the traditional DE algorithm, and by 844 million kW·h compared with the adaptive genetic algorithm. [Conclusions] This study proposes an intelligent algorithm that integrates reinforcement learning with differential evolution, effectively addressing issues such as premature convergence and search stagnation in the traditional DE algorithm. The proposed method provides an efficient and reliable solution for the optimal scheduling of cascade reservoirs.

Key words: cascade reservoirs, optimal scheduling, differential evolution, reinforcement learning, adaptive parameter adjustment

中图分类号:

TV697.1+2

陈佳雯, 祝欣, 汤正阳, 沈柯言, 陈晓淋, 覃晖. 基于RLDE算法的梯级水库发电优化调度方法[J]. raybet体育在线院报, 2025, 42(6): 210-218.

CHEN Jia-wen, ZHU Xin, TANG Zheng-yang, SHEN Ke-yan, CHEN Xiao-lin, QIN Hui. Optimal Scheduling Method for Power Generation of Cascade Reservoirs Based on RLDE Algorithm[J]. Journal of Changjiang River Scientific Research Institute, 2025, 42(6): 210-218.

电站名称	调节性能	调节库容/ (亿m³)	控制面积/ (万km²)	死水位/ m	正常蓄水位/m	汛限水位/ m	装机容量/ (万kW)
乌东德	季调节	30.20	40.61	945	975	952	1 020
白鹤滩	年调节	104.36	43.03	765	825	785	1 600
溪洛渡	不完全年调节	64.32	45.44	540	600	560	1 386
向家坝	不完全季调节	9.03	45.88	370	380	370	640

电站名称	调节性能	调节库容/ (亿m³)	控制面积/ (万km²)	死水位/ m	正常蓄水位/m	汛限水位/ m	装机容量/ (万kW)
乌东德	季调节	30.20	40.61	945	975	952	1 020
白鹤滩	年调节	104.36	43.03	765	825	785	1 600
溪洛渡	不完全年调节	64.32	45.44	540	600	560	1 386
向家坝	不完全季调节	9.03	45.88	370	380	370	640

缩放因子F	平均发电量/ (亿kW·h)	最优发电量/ (亿kW·h)	标准差/ (亿kW·h)
0.8	2 150.35	2 158.06	6.05
0.6	2 132.22	2 149.95	10.77
0.4	2 067.76	2 100.91	23.20
0.2	2 020.30	2 045.15	22.77

缩放因子F	平均发电量/ (亿kW·h)	最优发电量/ (亿kW·h)	标准差/ (亿kW·h)
0.8	2 150.35	2 158.06	6.05
0.6	2 132.22	2 149.95	10.77
0.4	2 067.76	2 100.91	23.20
0.2	2 020.30	2 045.15	22.77

来水情况	算法名称	平均值/ (亿kW·h)	绝对提升量/ (亿kW·h)	相对提升量/%
	DE	2 150.35	39.65	1.84
平水年	AGA	2 146.03	43.97	2.05
	RLDE	2 190.00
	DE	2 300.89	40.37	1.75
丰水年	AGA	2 302.12	39.14	1.70
	RLDE	2 341.26
	DE	1 986.54	49.03	2.47
枯水年	AGA	1 987.22	48.34	2.43
	RLDE	2 035.56

基于RLDE算法的梯级水库发电优化调度方法

Optimal Scheduling Method for Power Generation of Cascade Reservoirs Based on RLDE Algorithm

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 9

相关文章 15

编辑推荐

Metrics

本文评价

来水情况	算法名称	最优值	平均值	最差值	极差	标准差	平均执行时长/s
	DE	2 158.06	2 150.35	2 139.00	19.07	6.05	2.20
平水年	AGA	2 161.90	2 146.03	2 133.64	28.25	9.50	3.10
	RLDE	2 190.62	2 190.00	2 189.01	1.62	0.54	3.90
	DE	2 313.85	2 300.89	2 282.26	31.59	10.09	2.00
丰水年	AGA	2 312.11	2 302.12	2 290.00	22.10	8.41	3.00
	RLDE	2 343.21	2 341.26	2 339.93	3.28	1.00	3.70
枯水年	DE	2 001.72	1 986.54	1 970.13	31.59	8.33	2.50
	AGA	1 997.98	1 987.22	1 970.38	27.60	10.00	3.00
	RLDE	2 036.78	2 035.56	2 033.53	3.25	1.07	4.00

方案	不同进化代数梯级发电量/(亿kW·h)								方法
方案	1	100	200	300	400	450	480	500	方法
1	有破坏	2 165.70	2 167.80	2 167.86	2 167.86	2 167.86	2 167.86	2 167.86	DE
2	1 936.83	2 122.12	2 149.32	2 160.41	2 164.98	2 165.05	2 168.33	2 168.33	AGA
3	1 928.94	2 168.51	2 191.21	2 195.72	2 199.34	2 199.96	2 200.13	2 200.29	简单RLDE
4	1 928.94	2 168.51	2 191.21	2 195.72	2 199.34	2 200.01	2 200.20	2 200.33	RLDE+步长策略
5	1 963.49	2 155.80	2 192.36	2 198.30	2 200.34	2 200.61	2 200.70	2 200.74	RLDE +步长策略+ 初始化策略

[1]	李泽宏, 袁肖峰, 肖鹏, 张太衡, 覃晖. 基于多目标飞蛾扑火算法的水光互补系统优化调度[J]. raybet体育在线院报, 2025, 42(6): 203-209.
[2]	方国华, 刘畅, 丁紫玉. 基于备用流量的水风光多能互补优化调度[J]. raybet体育在线院报, 2025, 42(4): 36-44.
[3]	陈进. 长江梯级水库群联合调度成效、挑战及对策[J]. raybet体育在线院报, 2024, 41(5): 1-7.
[4]	徐杨, 吕昊, 刘帅, 方威, 覃晖. Kriging水动力学代理模型在水库群优化调度中的应用[J]. raybet体育在线院报, 2024, 41(2): 7-13.
[5]	何耀耀, 胡千帝, 张召. 基于自适应混沌精英变异差分进化算法的中长期水资源优化调度[J]. raybet体育在线院报, 2024, 41(10): 14-22.
[6]	陈进. 严重干旱情景下长江水资源调控与管理[J]. raybet体育在线院报, 2023, 40(3): 1-5.
[7]	汪涛, 徐杨, 刘亚新, 卢佳, 马皓宇. 基于多种群引力粒子群算法的金沙江下游—三峡梯级水库群优化调度[J]. raybet体育在线院报, 2023, 40(12): 30-36,58.
[8]	欧阳硕, 徐长江, 邵骏, 胡丰渝. 干旱条件下长江上游梯级水库群蓄水形势初探[J]. raybet体育在线院报, 2023, 40(12): 15-22.
[9]	张海荣, 姚华明, 汤正阳, 吴碧琼, 信天旗. 雅砻江和金沙江中下游梯级水库联合优化调度建模及应用Ⅱ——联合优化调度规则分析[J]. raybet体育在线院报, 2022, 39(9): 38-42.
[10]	张海荣, 姚华明, 鲍正风, 汤正阳, 华小军, 张东杰. 雅砻江和金沙江中下游梯级水库联合优化调度建模及应用Ⅰ——联合优化调度潜力分析[J]. raybet体育在线院报, 2022, 39(9): 30-37.
[11]	孙桂凯, 石锐, 刘思怡, 王国帅, 赵荣娜, 莫崇勋. 基于长期与中长期嵌套的水库优化调度[J]. raybet体育在线院报, 2022, 39(8): 23-28.
[12]	蒋建灵. 杭州江北主城区排涝格局优化研究[J]. raybet体育在线院报, 2021, 38(7): 42-45.
[13]	周婷, 戚王月, 金菊良. 水库群优化调度中的结构分析方法研究进展[J]. raybet体育在线院报, 2020, 37(12): 14-21.
[14]	钟文杰, 陈璐, 周建中, 仇红亚, 黄康迪. 考虑随机来水的水电站中长期发电调度多重风险分析[J]. raybet体育在线院报, 2020, 37(10): 37-44.
[15]	李荣波,纪昌明,孙平,刘丹,张璞,李继清. 基于改进混合蛙跳算法的梯级水库优化调度[J]. raybet体育在线院报, 2018, 35(6): 30-35.