Safe Exploration Reinforcement Learning for Load Restoration using Invalid Action Masking

February 15, 2024

Conference Paper

Safe Exploration Reinforcement Learning for Load Restoration using Invalid Action Masking

Abstract

This paper addresses the load restoration problem after a power outage event. Our primary proposed methodology uses a multi-agent reinforcement learning method to make the optimal sequential decisions on picking up critical loads. Typically, a negative reward is provided to discourage the agents from selecting decisions that violate physical constraints during the restoration process. However, the main disadvantage of this approach is its difficulty in applying it to large-scale systems due to the curse of dimensionality. This paper introduces the invalid action masking technique to overcome this limitation. The features of this technique include zero physical constraint violations, reduced training time, and stabilization of the explo- ration process. Simulation results are performed in IEEE 13-node and IEEE 123-node systems showing the better performance of the proposed algorithm in comparison to the conventional approaches both in terms of restored power and learning curve.

Published: February 15, 2024

Citation

Vu L., T. Vu, T. Vu, and S. Anurag. 2023. Safe Exploration Reinforcement Learning for Load Restoration using Invalid Action Masking. In IEEE Power & Energy Society General Meeting (PESGM 2023), July 16-20, 2023, Orlando, FL, 1-5. Piscataway, New Jersey:IEEE. PNNL-SA-179868. doi:10.1109/PESGM52003.2023.10253213

Research topics

Electric Grid Modernization

PNNL

Safe Exploration Reinforcement Learning for Load Restoration using Invalid Action Masking

Abstract

Citation

Research topics

Port Technical Assistance Program: A Proposed Initiative to Support U.S. Ports Through Energy Innovation

Current Best Practices on Wildfire Risk Reduction for Electric Transmission and Distribution Systems

Modeling Distributed Energy Resource Aggregations in Security Constrained Unit Commitment and Economic Dispatch