期刊
ENERGY
卷 133, 期 -, 页码 348-365出版社
PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.energy.2017.05.114
关键词
Deep transfer Q-learning; Virtual leader-follower; Supply-demand Stackelberg game; Smart grid
资金
- National Key Basic Research Program of China (973 Program) [2013CB228205]
- National Natural Science Foundation of China [51477055]
- Yunnan Provincial Talents Training Program [KKSY201604044]
- Scientific Research Foundation of Yunnan Provincial Department of Education [KKJB201704007]
This paper proposes a novel deep transfer Q-learning (DTQ) associated with a virtual leader-follower pattern for supply-demand Stackelberg game of smart grid. Each generator and load are regarded as an agent of a supplier and a demander, respectively, in which an economic dispatch (ED) and demand response (DR) can be simultaneously solved by DTQ To maximize the total payoff of all the agents, a virtual leader-follower pattern is employed to achieve a reliable collaboration among the agents. Then, Q-learning with a cooperative swarm is adopted for the knowledge learning for each agent via appropriate explorations and exploitations in an unknown environment. Furthermore, the original extremely large-scale knowledge matrix can be efficiently decomposed into several simplified small-scale knowledge matrices through a binary state-action chain, while the continuous actions can be generated for continuous variables. Lastly, a deep belief network (DBN) is used for knowledge transfer, thus DTQ can effectively exploit the prior knowledge from source tasks so as to rapidly obtain an optimal solution of a new task. Case studies are carried out to evaluate the performance of DTQ for supply-demand Stackelberg game of smart grid on a 94-agent system and a practical Shenzhen power grid of southern China. (C) 2017 Published by Elsevier Ltd.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据