Policy iteration based Q-learning for linear nonzero-sum quadratic differential games
Published 2019 View Full Article
- Home
- Publications
- Publication Search
- Publication Details
Title
Policy iteration based Q-learning for linear nonzero-sum quadratic differential games
Authors
Keywords
adaptive dynamic programming, ADP, Q-learning, reinforcement learning, RL, linear nonzero-sum quadratic differential games, policy iteration, PI, off-policy
Journal
Science China-Information Sciences
Volume 62, Issue 5, Pages -
Publisher
Springer Nature
Online
2019-04-08
DOI
10.1007/s11432-018-9602-1
References
Ask authors/readers for more resources
Related references
Note: Only part of the references are listed.- Output feedback Q-learning for discrete-time linear zero-sum games with application to the H-infinity control
- (2018) Syed Ali Asad Rizvi et al. AUTOMATICA
- Cooperative Q-Learning for Rejection of Persistent Adversarial Inputs in Networked Linear Quadratic Systems
- (2018) Kyriakos G. Vamvoudakis et al. IEEE TRANSACTIONS ON AUTOMATIC CONTROL
- Off-Policy Q-Learning: Set-Point Design for Optimizing Dual-Rate Rougher Flotation Operational Processes
- (2018) Jinna Li et al. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS
- Developing nonlinear adaptive optimal regulators through an improved neural learning mechanism
- (2017) Ding Wang et al. Science China-Information Sciences
- Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach
- (2017) Kyriakos G. Vamvoudakis SYSTEMS & CONTROL LETTERS
- Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games
- (2017) Ruizhuo Song et al. IEEE Transactions on Neural Networks and Learning Systems
- Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data
- (2017) Yuanheng Zhu et al. IEEE Transactions on Neural Networks and Learning Systems
- Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms
- (2017) Huaguang Zhang et al. IEEE Transactions on Cybernetics
- Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control
- (2017) Biao Luo et al. IEEE Transactions on Cybernetics
- Construction of Barrier in a Fishing Game With Point Capture
- (2017) Wenzhong Zha et al. IEEE Transactions on Cybernetics
- Error Bound Analysis of $Q$ -Function for Discounted Optimal Control Problems With Policy Iteration
- (2017) Pengfei Yan et al. IEEE Transactions on Systems Man Cybernetics-Systems
- Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics
- (2016) Dongbin Zhao et al. IEEE Transactions on Cybernetics
- Cross-Modal Retrieval With CNN Visual Features: A New Baseline
- (2016) Yunchao Wei et al. IEEE Transactions on Cybernetics
- Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems
- (2015) Kyriakos G. Vamvoudakis AUTOMATICA
- Constructive $\epsilon$-Nash Equilibria for Nonzero-Sum Differential Games
- (2015) Thulasi Mylvaganam et al. IEEE TRANSACTIONS ON AUTOMATIC CONTROL
- A novel policy iteration based deterministic Q-learning for discrete-time nonlinear systems
- (2015) QingLai Wei et al. Science China-Information Sciences
- Approximate $N$ -Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System
- (2015) Marcus Johnson et al. IEEE Transactions on Neural Networks and Learning Systems
- $ {H}_{ {\infty }}$ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning
- (2015) Hamidreza Modares et al. IEEE Transactions on Neural Networks and Learning Systems
- Continuous-Time Q-Learning for Infinite-Horizon Discounted Cost Linear Quadratic Regulator Problems
- (2015) Muthukumar Palanisamy et al. IEEE Transactions on Cybernetics
- Off-Policy Reinforcement Learning for $ H_\infty $ Control Design
- (2015) Biao Luo et al. IEEE Transactions on Cybernetics
- Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design
- (2014) Biao Luo et al. AUTOMATICA
- Online Synchronous Approximate Optimal Learning Algorithm for Multi-Player Non-Zero-Sum Games With Unknown Dynamics
- (2014) Derong Liu et al. IEEE Transactions on Systems Man Cybernetics-Systems
- Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality
- (2012) Kyriakos G. Vamvoudakis et al. AUTOMATICA
- Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
- (2012) Yu Jiang et al. AUTOMATICA
- Differential Games Controllers That Confine a System to a Safe Region in the State Space, With Applications to Surge Tank Control
- (2012) Paola Falugi et al. IEEE TRANSACTIONS ON AUTOMATIC CONTROL
- Near-Optimal Control for Nonzero-Sum Differential Games of Continuous-Time Nonlinear Systems Using Single-Network ADP
- (2012) Huaguang Zhang et al. IEEE Transactions on Cybernetics
- Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton–Jacobi equations
- (2011) Kyriakos G. Vamvoudakis et al. AUTOMATICA
- Hybrid MDP based integrated hierarchical Q-learning
- (2011) ChunLin Chen et al. Science China-Information Sciences
- An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
- (2010) Huaguang Zhang et al. AUTOMATICA
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- (2008) D. Vrabie et al. AUTOMATICA
- Neurodynamic Programming and Zero-Sum Games for Constrained Control Systems
- (2008) M. Abu-Khalaf et al. IEEE TRANSACTIONS ON NEURAL NETWORKS
Discover Peeref hubs
Discuss science. Find collaborators. Network.
Join a conversationCreate your own webinar
Interested in hosting your own webinar? Check the schedule and propose your idea to the Peeref Content Team.
Create Now