Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming

Title
Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
Authors
Keywords
-
Journal
MATHEMATICS OF OPERATIONS RESEARCH
Volume 37, Issue 1, Pages 66-94
Publisher
Institute for Operations Research and the Management Sciences (INFORMS)
Online
2012-01-14
DOI
10.1287/moor.1110.0532

Ask authors/readers for more resources

Find Funding. Review Successful Grants.

Explore over 25,000 new funding opportunities and over 6,000,000 successful grants.

Explore

Ask a Question. Answer a Question.

Quickly pose questions to the entire community. Debate answers and get clarity on the most important issues facing researchers.

Get Started