4.7 Article

Neural Network Based Online Simultaneous Policy Update Algorithm for Solving the HJI Equation in Nonlinear H∞ Control

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2012.2217349

关键词

H-infinity state feedback control; Hamilton-Jacobi-Isaacs equation; neural network; online; simultaneous policy update algorithm

资金

  1. National Basic Research Program of China through the 973 Program [2012CB720003]
  2. National Natural Science Foundation of China [61074057, 61121003, 91016004]

向作者/读者索取更多资源

It is well known that the nonlinear H-infinity state feedback control problem relies on the solution of the Hamilton-Jacobi-Isaacs (HJI) equation, which is a nonlinear partial differential equation that has proven to be impossible to solve analytically. In this paper, a neural network (NN)-based online simultaneous policy update algorithm (SPUA) is developed to solve the HJI equation, in which knowledge of internal system dynamics is not required. First, we propose an online SPUA which can be viewed as a reinforcement learning technique for two players to learn their optimal actions in an unknown environment. The proposed online SPUA updates control and disturbance policies simultaneously; thus, only one iterative loop is needed. Second, the convergence of the online SPUA is established by proving that it is mathematically equivalent to Newton's method for finding a fixed point in a Banach space. Third, we develop an actor-critic structure for the implementation of the online SPUA, in which only one critic NN is needed for approximating the cost function, and a least-square method is given for estimating the NN weight parameters. Finally, simulation studies are provided to demonstrate the effectiveness of the proposed algorithm.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Automation & Control Systems

Bipartite output consensus in networked multi-agent systems of high-order power integrators with signed digraph and input noises

Hongwen Ma, Derong Liu, Ding Wang, Biao Luo

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE (2016)

Article Computer Science, Artificial Intelligence

Reinforcement learning solution for HJB equation arising in constrained optimal control problem

Biao Luo, Huai-Ning Wu, Tingwen Huang, Derong Liu

NEURAL NETWORKS (2015)

Article Computer Science, Information Systems

Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning

Xiong Yang, Derong Liu, Biao Luo, Chao Li

INFORMATION SCIENCES (2016)

Article Computer Science, Artificial Intelligence

Model-Free Optimal Tracking Control via Critic-Only Q-Learning

Biao Luo, Derong Liu, Tingwen Huang, Ding Wang

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2016)

Article Automation & Control Systems

An Approximate Optimal Control Approach for Robust Stabilization of a Class of Discrete-Time Nonlinear Systems With Uncertainties

Ding Wang, Derong Liu, Hongliang Li, Biao Luo, Hongwen Ma

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2016)

Article Automation & Control Systems

Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation

Biao Luo, Derong Liu, Tingwen Huang, Jiangjiang Liu

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2019)

Article Automation & Control Systems

Adaptive Synchronization of Delayed Memristive Neural Networks With Unknown Parameters

Zhanyu Yang, Biao Luo, Derong Liu, Yueheng Li

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2020)

Article Automation & Control Systems

Optimal Output Regulation for Model-Free Quanser Helicopter With Multistep Q-Learning

Biao Luo, Huai-Ning Wu, Tingwen Huang

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS (2018)

Article Computer Science, Information Systems

Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties

Xiong Yang, Haibo He, Qinglai Wei, Biao Luo

INFORMATION SCIENCES (2018)

Article Automation & Control Systems

Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay

Biao Luo, Yin Yang, Derong Liu

IEEE TRANSACTIONS ON CYBERNETICS (2018)

Article Computer Science, Artificial Intelligence

Adaptive dynamic programming based event-triggered control for unknown continuous-time nonlinear systems with input constraints

Shan Xue, Biao Luo, Derong Liu, Yueheng Li

NEUROCOMPUTING (2020)

Article Computer Science, Artificial Intelligence

Adaptive synchronization of memristor-based neural networks with discontinuous activations

Yueheng Li, Biao Luo, Derong Liu, Zhanyu Yang, Yunli Zhu

NEUROCOMPUTING (2020)

Article Computer Science, Artificial Intelligence

Event-Triggered Optimal Control With Performance Guarantees Using Adaptive Dynamic Programming

Biao Luo, Yin Yang, Derong Liu, Huai-Ning Wu

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2020)

Article Computer Science, Artificial Intelligence

Multi-scale local LSSVM based spatiotemporal modeling and optimal control for the goethite process

Jiayang Dai, Ning Chen, Biao Luo, Weihua Gui, Chunhua Yang

NEUROCOMPUTING (2020)

Proceedings Paper Computer Science, Artificial Intelligence

Pinning Control for Synchronization of Drive-Response Memristive Neural Networks with Nonidentical Parameters

Yueheng Li, Biao Luo, Derong Liu, Zhe Dong, Zhanyu Yang

2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) (2019)

暂无数据