A reinforcement learning approach to parameter selection for distributed optimal power flow
Abstract
With the increasing penetration of distributed energy resources, distributed optimization algorithms have attracted significant attention for power systems applications due to their potential for superior scalability, privacy, and robustness to a single point-of-failure. The Alternating Direction Method of Multipliers (ADMM) is a popular distributed optimization algorithm; however, its convergence performance is highly dependent on the selection of penalty parameters, which are usually chosen heuristically. In this work, we use reinforcement learning (RL) to develop an adaptive penalty parameter selection policy for alternating current optimal power flow (ACOPF) problem solved via ADMM with the goal of minimizing the number of iterations until convergence. We train our RL policy using deep Q-learning and show that this policy can result in significantly accelerated convergence (up to a 59% reduction in the number of iterations compared to existing, curvature-informed penalty parameter selection methods). Furthermore, we show that our RL policy demonstrates promise for generalizability, performing well under unseen loading schemes as well as under unseen losses of lines and generators (up to a 50% reduction in iterations). This work thus provides a proof-of-concept for using RL for parameter selection in ADMM for power systems applications.
- Publication:
-
Electric Power Systems Research
- Pub Date:
- November 2022
- DOI:
- 10.1016/j.epsr.2022.108546
- arXiv:
- arXiv:2110.11991
- Bibcode:
- 2022EPSR..21208546Z
- Keywords:
-
- Alternating direction method of multipliers;
- Alternating current optimal power flow;
- Distributed optimization;
- Reinforcement learning;
- Deep Q-learning;
- Electrical Engineering and Systems Science - Systems and Control;
- Computer Science - Machine Learning