4.7 Article

A Scalable Privacy-Preserving Multi-Agent Deep Reinforcement Learning Approach for Large-Scale Peer-to-Peer Transactive Energy Trading

Journal

IEEE TRANSACTIONS ON SMART GRID
Volume 12, Issue 6, Pages 5185-5200

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSG.2021.3103917

Keywords

HVAC; Peer-to-peer computing; Transactive energy; Scalability; Uncertainty; Reinforcement learning; Production; Distributed energy resources; energy management; local energy community; multi-agent deep reinforcement learning; peer-to-peer transactive energy trading

Funding

  1. National Natural Science Foundation of China [51877037]

Ask authors/readers for more resources

This paper proposes a novel P2P transactive trading scheme based on the MAAC algorithm, addressing challenges in large-scale coordinated management and P2P trading within an energy community, while incentivizing prosumers to engage in energy trading through a P2P trading platform, resulting in significant cost reduction and peak demand reduction for the community.
Peer-to-peer (P2P) transactive energy trading has emerged as a promising paradigm towards maximizing the flexibility value of prosumers' distributed energy resources (DERs). Despite reinforcement learning constitutes a well-suited model-free and data-driven methodological framework to optimize prosumers' energy management decisions, its application to the large-scale coordinated management and P2P trading among multiple prosumers within an energy community is still challenging, due to the scalability, non-stationarity and privacy limitations of state-of-the-art multi-agent deep reinforcement learning (MADRL) approaches. This paper proposes a novel P2P transactive trading scheme based on the multi-actor-attention-critic (MAAC) algorithm, which addresses the above challenges individually. This method is complemented by a P2P trading platform that incentivizes prosumers to engage in local energy trading while also penalizes each prosumer's addition to rebound peaks. Case studies involving a real-world, large-scale scenario with 300 residential prosumers demonstrate that the proposed method significantly outperforms the state-of-the-art MADRL methods in reducing the community's cost and peak demand.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available