Policy iteration based Q-learning for linear nonzero-sum quadratic differential games

Published 2019 View Full Article

Authors

Keywords

adaptive dynamic programming, ADP, Q-learning, reinforcement learning, RL, linear nonzero-sum quadratic differential games, policy iteration, PI, off-policy

Journal

Science China-Information Sciences

Volume 62, Issue 5, Pages -

Publisher

Springer Nature

Online

2019-04-08

DOI

10.1007/s11432-018-9602-1

References

View 31 related references

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Reprint

Contact the author

Discover Peeref hubs

Discuss science. Find collaborators. Network.

Join a conversation

Create your own webinar

Interested in hosting your own webinar? Check the schedule and propose your idea to the Peeref Content Team.

Create Now