期刊
OPTIMAL CONTROL APPLICATIONS & METHODS
卷 38, 期 3, 页码 317-335出版社
WILEY
DOI: 10.1002/oca.2259
关键词
adaptive dynamic programming; optimal control; discrete-time; nonlinear system; neural network; online learning; Lyapunov method
资金
- National Natural Science Foundation of China [61233001, 61273140, 61304086, 61374105]
- Beijing Natural Science Foundation [4132078]
- Early Career Development Award of SKLMCCS
In this paper, a novel identifier-actor-critic optimal control scheme is developed for discrete-time affine nonlinear systems with uncertainties. In contrast to traditional adaptive dynamic programming methodology, which requires at least partial knowledge of the system dynamics, a neural-network identifier is employed to learn the unknown control coefficient matrix working together with actor-critic-based scheme to solve the optimal control online. The critic network learns the approximate value function at each step. The actor network attempts to improve the current policy based on the approximate value function. The weights of all neural networks are updated at each sampling instant. Lyapunov theory is utilized to prove the stability of closed-loop system. It shows that system states and neural network weights are uniformly ultimately bounded. Finally, simulations are provided to illustrate the effectiveness of the developed method. Copyright (C) 2016 John Wiley & Sons, Ltd.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据