☆ 4.7 Article

Deep transfer Q-learning with virtual leader-follower for supply-demand Stackelberg game of smart grid

ENERGY (2017)

期刊

ENERGY

卷 133, 期 -, 页码 348-365

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.energy.2017.05.114

关键词

Deep transfer Q-learning; Virtual leader-follower; Supply-demand Stackelberg game; Smart grid

类别

Thermodynamics Energy & Fuels

资金

National Key Basic Research Program of China (973 Program) [2013CB228205]
National Natural Science Foundation of China [51477055]
Yunnan Provincial Talents Training Program [KKSY201604044]
Scientific Research Foundation of Yunnan Provincial Department of Education [KKJB201704007]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

This paper proposes a novel deep transfer Q-learning (DTQ) associated with a virtual leader-follower pattern for supply-demand Stackelberg game of smart grid. Each generator and load are regarded as an agent of a supplier and a demander, respectively, in which an economic dispatch (ED) and demand response (DR) can be simultaneously solved by DTQ To maximize the total payoff of all the agents, a virtual leader-follower pattern is employed to achieve a reliable collaboration among the agents. Then, Q-learning with a cooperative swarm is adopted for the knowledge learning for each agent via appropriate explorations and exploitations in an unknown environment. Furthermore, the original extremely large-scale knowledge matrix can be efficiently decomposed into several simplified small-scale knowledge matrices through a binary state-action chain, while the continuous actions can be generated for continuous variables. Lastly, a deep belief network (DBN) is used for knowledge transfer, thus DTQ can effectively exploit the prior knowledge from source tasks so as to rapidly obtain an optimal solution of a new task. Case studies are carried out to evaluate the performance of DTQ for supply-demand Stackelberg game of smart grid on a 94-agent system and a practical Shenzhen power grid of southern China. (C) 2017 Published by Elsevier Ltd.

Deep transfer Q-learning with virtual leader-follower for supply-demand Stackelberg game of smart grid

期刊

ENERGY

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Deep transfer Q-learning with virtual leader-follower for supply-demand Stackelberg game of smart grid

期刊

ENERGY

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文