Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming

出版年份 2012 全文链接

标题

作者

关键词

出版物

Volume 37, Issue 1, Pages 66-94

出版商

Institute for Operations Research and the Management Sciences (INFORMS)

发表日期

2012-01-14

DOI

10.1287/moor.1110.0532

Explore over 25,000 new funding opportunities and over 6,000,000 successful grants.

Explore

Quickly pose questions to the entire community. Debate answers and get clarity on the most important issues facing researchers.

Get Started