Policy iteration based Q-learning for linear nonzero-sum quadratic differential games

Title
Policy iteration based Q-learning for linear nonzero-sum quadratic differential games
Authors
Keywords
adaptive dynamic programming, ADP, Q-learning, reinforcement learning, RL, linear nonzero-sum quadratic differential games, policy iteration, PI, off-policy
Journal
Science China-Information Sciences
Volume 62, Issue 5, Pages -
Publisher
Springer Nature
Online
2019-04-08
DOI
10.1007/s11432-018-9602-1

Ask authors/readers for more resources

Reprint

Contact the author

Discover Peeref hubs

Discuss science. Find collaborators. Network.

Join a conversation

Create your own webinar

Interested in hosting your own webinar? Check the schedule and propose your idea to the Peeref Content Team.

Create Now