☆ 4.7 Article

Robust Adaptive Dynamic Programming of Two-Player Zero-Sum Games for Continuous-Time Linear Systems

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2015)

Journal

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

Volume 26, Issue 12, Pages 3314-3319

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TNNLS.2015.2461452

Keywords

Game algebraic Riccati equation (GARE); policy iterations (PIs); robust adaptive dynamic programming (ADP); two-player zero-sum (ZS) games

Categories

Computer Science, Artificial Intelligence Computer Science, Hardware & Architecture Computer Science, Theory & Methods Engineering, Electrical & Electronic

Funding

Natural Science Foundation of China [61573090, 61473063]
Research Funds for the Central Universities [N130408003, N130108001]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

In this brief, an online robust adaptive dynamic programming algorithm is proposed for two-player zero-sum games of continuous-time unknown linear systems with matched uncertainties, which are functions of system outputs and states of a completely unknown exosystem. The online algorithm is developed using the policy iteration (PI) scheme with only one iteration loop. A new analytical method is proposed for convergence proof of the PI scheme. The sufficient conditions are given to guarantee globally asymptotic stability and suboptimal property of the closed-loop system. Simulation studies are conducted to illustrate the effectiveness of the proposed method.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7

Not enough ratings

Secondary Ratings

Novelty

-

Significance

-

Scientific rigor

-

Rate this paper

Recommended

Article Automation & Control Systems

Non-zero-sum games of discrete-time Markov jump systems with unknown dynamics: An off-policy reinforcement learning method

Xuewen Zhang, Hao Shen, Feng Li, Jing Wang

Summary: This article focuses on the non-zero-sum games problem in discrete-time Markov jump systems. It proposes a model-based algorithm and an off-policy reinforcement learning algorithm to obtain optimal control policies without relying on system dynamics information.

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL (2023)

Add to Collection

Article Mathematics, Applied

Online reinforcement learning multiplayer non-zero sum games of continuous-time Markov jump linear systems

Xilin Xin, Yidong Tu, Vladimir Stojanovic, Hai Wang, Kaibo Shi, Shuping He, Tianhong Pan

Summary: This paper proposes a novel online mode-free integral reinforcement learning algorithm to solve multiplayer non-zero sum games. By collecting and learning subsystem information of states and inputs, and using online learning to compute corresponding N-coupled algebraic Riccati equations, the policy iterative algorithm presented in this paper can solve the coupled algebraic Riccati equations of multiplayer non-zero sum games. The effectiveness and feasibility of the design method is verified through a simulation example involving three players.

APPLIED MATHEMATICS AND COMPUTATION (2022)

Add to Collection

Article Automation & Control Systems

A Generalized Minimax Q-Learning Algorithm for Two-Player Zero-Sum Stochastic Games

Raghuram Bharadwaj Diddigi, Chandramouli Kamanchi, Shalabh Bhatnagar

Summary: This article investigates the problem of two-player zero-sum games and proposes a technique of successive relaxation to compute the min-max value faster. A generalized minimax Q-learning algorithm is also derived for finding the optimal policy when the model information is unknown.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2022)

Add to Collection

Article Automation & Control Systems

A data-based private learning framework for enhanced security against replay attacks in cyber-physical systems

Lijing Zhai, Kyriakos G. Vamvoudakis

Summary: This article presents a data-based and private learning framework for detecting and mitigating replay attacks in cyber-physical systems. Optimal watermarking signals and a level of differential privacy have been added to improve the capability against replay attacks. By using data-based techniques, the best defending strategy has been learned, and a Neyman-Pearson detector has been proposed to identify replay attacks. Simulation results demonstrate the effectiveness of the approach and compare the data-based technique with a model-based one.

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL (2021)

Add to Collection

Article Engineering, Mechanical

Adaptive critic design for nonlinear multi-player zero-sum games with unknown dynamics and control constraints

Yu Huo, Ding Wang, Junfei Qiao, Menghua Li

Summary: This paper proposes a novel optimal control scheme based on the adaptive critic technology to solve the multi-player zero-sum game issue of continuous-time nonlinear systems with control constraints and unknown dynamics. A neural network-based identifier is used to reconstruct the unknown system dynamics, and a new nonquadratic function is developed to derive the associated Hamilton-Jacobi-Isaacs equation of the constrained game. An adaptive critic framework is then constructed to approximate the optimal cost function and estimate the optimal control strategy sets and worst disturbance. Theoretical analysis using Lyapunov stability theorem proves the uniform ultimate boundedness stability of the system state and the critic network weight approximation error. A representative example is simulated to validate the efficacy of the proposed framework.

NONLINEAR DYNAMICS (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

A dynamical neural network approach for solving stochastic two-player zero-sum games

Dawen Wu, Abdel Lisser

Summary: This paper tackles a stochastic two-player zero-sum Nash game problem by modeling it as a dynamical neural network (DNN), showing that the DNN method has advantages in converging to better optimal points and solving large-scale problems.

NEURAL NETWORKS (2022)

Add to Collection

Article Management

Continuous Patrolling Games

Steve Alpern, Thuy Bui, Thomas Lidbetter, Katerina Papadaki

Summary: This study focuses on a patrolling game played on a network, aiming to model the problem of protecting roads or pipelines from adversarial attacks. The results provide solutions to the game for different network structures and attack durations.

OPERATIONS RESEARCH (2022)

Add to Collection

Article Computer Science, Artificial Intelligence

Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games

Yuanheng Zhu, Dongbin Zhao

Summary: This paper combines game theory, dynamic programming, and recent deep reinforcement learning techniques to online learn the Nash equilibrium policy for two-player zero-sum Markov games. By formulating the problem as a Bellman minimax equation and applying generalized policy iteration, the authors propose a learning algorithm that utilizes neural networks to approximate Q functions. The algorithm is proven to have convergence and is validated through experiments on different examples.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Add to Collection

Article Automation & Control Systems

An addendum to the problem of zero-sum LQ stochastic mean-field dynamic games?

Samir Aberkane, Vasile Dragan

Summary: In this paper, we address a linear quadratic mean-field game problem with a leader-follower structure. We show how to obtain a state-feedback representation of the strategies achieving an open-loop Stackelberg equilibrium using a Riccati-type approach. We also establish the necessary and sufficient conditions for the solvability of the coupled generalized Riccati equations involved.

AUTOMATICA (2023)

Add to Collection

Article Computer Science, Information Systems

Event-triggered optimal control for discrete-time multi-player non-zero-sum games using parallel control

Jingwei Lu, Qinglai Wei, Ziyang Wang, Tianmin Zhou, Fei-Yue Wang

Summary: This paper introduces a novel event-triggered optimal control method for discrete-time multi-player non-zero-sum games. By combining event-triggered algorithm with parallel control, the system's asymptotic stability can be achieved and an upper bound for the sum of all players' actual performance indices can be determined in advance.

INFORMATION SCIENCES (2022)

Add to Collection

Review Computer Science, Information Systems

Modelling web-service uncertainty: The angel daemon approach

Joaquim Gabarro, Alan Stewart

Summary: The paper presents a survey of joint research work on uncertain systems with a focus on the behavior of large web applications under external attacks. Uncertain, multi-component systems can be modeled by orchestrations which call multiple web services and coordinate their responses. Uncertainty profiles are used to evaluate system behavior by providing a blurred snapshot of operating conditions.

COMPUTER SCIENCE REVIEW (2021)

Add to Collection

Article Automation & Control Systems

Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity

Kaiqing Zhang, Sham M. Kakade, Tamer Basar, Lin F. Yang

Summary: This paper investigates the sample complexity of model-based reinforcement learning in multi-agent settings. By studying discounted zero-sum Markov games with two players, the paper shows the sample complexity of model-based MARL in finding the Nash equilibrium value and ε-NE policies, and compares it with reward-aware algorithms.

JOURNAL OF MACHINE LEARNING RESEARCH (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

Observer-based event-triggered control for zero-sum games of input constrained multi-player nonlinear systems

Shunchao Zhang, Bo Zhao, Derong Liu, Yongwei Zhang

Summary: This paper investigates an event-triggered control method based on adaptive dynamic programming to solve zero-sum game problems in unknown multi-player continuous-time nonlinear systems. By constructing a neural network observer to identify system dynamics and solving the ZSG problem using a critic NN, a triggering scheme is developed to update control and disturbance laws, ultimately proving the effectiveness of the proposed method.

NEURAL NETWORKS (2021)

Add to Collection

Article Automation & Control Systems

Improved saddle point prediction in stochastic two-player zero-sum games with a deep learning approach

Dawen Wu, Abdel Lisser

Summary: In this paper, we propose a novel deep learning approach that combines neurodynamic optimization and deep neural networks to predict saddle points in stochastic two-player zero-sum games. We model the game as an ODE system using neurodynamic optimization and develop a neural network to approximate the solution, including the prediction of the saddle point. A specialized algorithm is introduced to enhance the accuracy of the saddle point prediction. Experimental results demonstrate that our model outperforms existing approaches in terms of convergence speed and accuracy of saddle point predictions.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2023)

Add to Collection

Article Engineering, Marine

Two-player zero-sum game based neural critic tracking control for UAUV with unknown disturbance via backstepping method

Gaofeng Che

Summary: This work proposes a new tracking control scheme for underactuated autonomous underwater vehicles (UAUVs) with unknown disturbance. By constructing an error tracking system and designing the online policy iteration algorithm, the proposed method achieves near-optimal control performance, improves the convergence speed of tracking error, and guarantees the stability of the system.

OCEAN ENGINEERING (2023)

Add to Collection

No Data Available

No Data Available

© Peeref 2019-2024. All rights reserved.