4.7 Article

Deep transfer Q-learning with virtual leader-follower for supply-demand Stackelberg game of smart grid

期刊

ENERGY
卷 133, 期 -, 页码 348-365

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.energy.2017.05.114

关键词

Deep transfer Q-learning; Virtual leader-follower; Supply-demand Stackelberg game; Smart grid

资金

  1. National Key Basic Research Program of China (973 Program) [2013CB228205]
  2. National Natural Science Foundation of China [51477055]
  3. Yunnan Provincial Talents Training Program [KKSY201604044]
  4. Scientific Research Foundation of Yunnan Provincial Department of Education [KKJB201704007]

向作者/读者索取更多资源

This paper proposes a novel deep transfer Q-learning (DTQ) associated with a virtual leader-follower pattern for supply-demand Stackelberg game of smart grid. Each generator and load are regarded as an agent of a supplier and a demander, respectively, in which an economic dispatch (ED) and demand response (DR) can be simultaneously solved by DTQ To maximize the total payoff of all the agents, a virtual leader-follower pattern is employed to achieve a reliable collaboration among the agents. Then, Q-learning with a cooperative swarm is adopted for the knowledge learning for each agent via appropriate explorations and exploitations in an unknown environment. Furthermore, the original extremely large-scale knowledge matrix can be efficiently decomposed into several simplified small-scale knowledge matrices through a binary state-action chain, while the continuous actions can be generated for continuous variables. Lastly, a deep belief network (DBN) is used for knowledge transfer, thus DTQ can effectively exploit the prior knowledge from source tasks so as to rapidly obtain an optimal solution of a new task. Case studies are carried out to evaluate the performance of DTQ for supply-demand Stackelberg game of smart grid on a 94-agent system and a practical Shenzhen power grid of southern China. (C) 2017 Published by Elsevier Ltd.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据