4.8 Article

Multi-agent deep deterministic policy gradient algorithm for peer-to-peer energy trading considering distribution network constraints

Journal

APPLIED ENERGY
Volume 317, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.apenergy.2022.119123

Keywords

Multi-agent; Deep deterministic policy gradient; Peer-to-peer energy trading; Renewable generation; Markov decision process

Funding

  1. Innovate UK Urban-X Project [106187]
  2. State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources, North China Electric Power University [LAPS20012]
  3. ERDF
  4. BEIS via the Smart Energy Network Demonstrator project [32R16P00706]

Ask authors/readers for more resources

This paper investigates the energy cost minimization problem for prosumers participating in peer-to-peer energy trading. A multi-agent deep deterministic policy gradient algorithm is proposed to learn optimal energy trading decisions, while distribution network tariffs are introduced to satisfy the distribution network constraints.
In this paper, we investigate an energy cost minimization problem for prosumers participating in peer -to-peer energy trading. Due to (i) uncertainties caused by renewable energy generation and consumption, (ii) difficulties in developing an accurate and efficient energy trading model, and (iii) the need to satisfy distribution network constraints, it is challenging for prosumers to obtain optimal energy trading decisions that minimize their individual energy costs. To address the challenge, we first formulate the above problem as a Markov decision process and propose a multi-agent deep deterministic policy gradient algorithm to learn optimal energy trading decisions. To satisfy the distribution network constraints, we propose distribution network tariffs which we incorporate in the algorithm as incentives to incentivize energy trading decisions that help to satisfy the constraints and penalize the decisions that violate them. The proposed algorithm is model -free and allows the agents to learn the optimal energy trading decisions without having prior information about other agents in the network. Simulation results based on real-world datasets show the effectiveness and robustness of the proposed algorithm.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available