4.7 Article

A Hybrid Data-Driven Method for Fast Solution of Security-Constrained Optimal Power Flow

Journal

IEEE TRANSACTIONS ON POWER SYSTEMS
Volume 37, Issue 6, Pages 4365-4374

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TPWRS.2022.3150023

Keywords

Costs; Load flow; Training; Reinforcement learning; Mathematical models; Security; Supervised learning; Security-constrained optimal power flow; model-assisted; deep reinforcement learning; primal-dual deep deterministic policy gradient

Funding

  1. Ministry of Education (MOE), Republic of Singapore [AcRF TIER 1 2019-T1-001-069, RG75/19, TPWRS-00702-2021]

Ask authors/readers for more resources

This paper proposes a hybrid data-driven method for fast solutions of preventive security-constrained optimal power flow (SCOPF) of power systems. The well-trained DRL agent can rapidly obtain high-quality SCOPF solutions that satisfy the security constraints by approximating actor gradients through solving KKT conditions.
This paper proposes a hybrid data-driven method for fast solutions of preventive security-constrained optimal power flow (SCOPF) of power systems. The proposed method formulates the SCOPF problem as constraints-satisfying training of a deep reinforcement learning (DRL) agent, where the action-value function of DRL is augmented by contingency security constraints. In the training process, the proposed method hybridizes the primal-dual deep deterministic policy gradient (PD-DDPG) and the classic SCOPF model. Instead of building reward critic networks and cost critic networks via interacting with the environment (i.e., power flow equations), the actor gradients are approximated by solving KKT conditions of the Lagrangian. Finally, with the formulated sparse Jacobians of constraints and sparse Hessians of Lagrangians, the interior point method is incorporated in PD-DDPG to derive the parameters updating rule of the DRL agent. Numerical tests are carried out on a modified IEEE 57-bus system and a modified IEEE 300-bus system for critical contingencies. The results show that the well-trained DRL agent can rapidly (real-time) obtain high-quality SCOPF solutions that satisfy the security constraints.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available