4.7 Article

Data-Driven Multi-Agent Deep Reinforcement Learning for Distribution System Decentralized Voltage Control With High Penetration of PVs

Journal

IEEE TRANSACTIONS ON SMART GRID
Volume 12, Issue 5, Pages 4137-4150

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSG.2021.3072251

Keywords

Voltage control; Training; Reactive power; Static VAr compensators; Optimization; Load modeling; Uncertainty; Voltage regulation; Gaussian process regression; network partition; multi-agent deep reinforcement learning; distribution network; PVs

Funding

  1. National Key Research and Development Program of China [2018YFE0127600, TSG-01176-2020]

Ask authors/readers for more resources

This paper proposes a novel model-free/data-driven centralized training and decentralized execution multi-agent deep reinforcement learning framework for distribution system voltage control with high penetration of PVs. The method does not rely on a system model during both the training and execution stages, achieving better scalability and significantly reducing communication requirements. By utilizing the MASAC algorithm and dividing the system into different voltage control regions with real and reactive power sensitivities, the proposed method demonstrates superiority in comparative results with other alternatives.
This paper proposes a novel model-free/data-driven centralized training and decentralized execution multi-agent deep reinforcement learning (MADRL) framework for distribution system voltage control with high penetration of PVs. The proposed MADRL can coordinate both the real and reactive power control of PVs with existing static var compensators and battery storage systems. Unlike the existing DRL-based voltage control methods, our proposed method does not rely on a system model during both the training and execution stages. This is achieved by developing a new interaction scheme between the surrogate modeling of the original system and the multi-agent soft actor critic (MASAC) MADRL algorithm. In particular, the sparse pseudo-Gaussian process with a few-shots of measurements is utilized to construct the surrogate model of the original environment, i.e., power flow model. This is a data-driven process and no model parameters are needed. Furthermore, the MASAC enabled MADRL allows to achieve better scalability by dividing the original system into different voltage control regions with the aid of real and reactive power sensitivities to voltage, where each region is treated as an agent. This also serves as the foundation for the centralized training and decentralized execution, thus significantly reducing the communication requirements as only local measurements are required for control. Comparative results with other alternatives on the IEEE 123-nodes and 342-nodes systems demonstrate the superiority of the proposed method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available