期刊
NEUROCOMPUTING
卷 251, 期 -, 页码 127-135出版社
ELSEVIER SCIENCE BV
DOI: 10.1016/j.neucom.2017.04.008
关键词
Optimal tracking control; Continuous nonlinear system; Adaptive critic design (ACD); Neural network
资金
- National Natural Science Foundation of China [61433004, 61627809, 61621004]
- IAPI Fundamental Research Funds [2013ZCX14]
In this paper, the optimal tracking control problem (OTCP) for a class of continuous-time nonlinear systems with infinite horizon cost is discussed. An online adaptive critic design method is proposed to learn the solution of OTCP by constructing an augmented system associated with a discounted performance function, which is composed of the tracking errors and reference trajectory dynamics. Only one neural network (NN) is used as critic module for approximating the performance function in the solution procedure, and thus the architecture is simpler than the typical action-critic structure, which needs more computational load from neural networks. Therefore, by the means of the approximate policy iteration, the tracking errors get converged to a region near zero and the parameters of critic module get converged to the optimal ones based on our proposed method. Both the convergence of the NN weights and the stability of the tracking error dynamics are guaranteed by the Lyapunov theory. Two simulation examples are proposed to verify the effectiveness of the proposed method. (C) 2017 Elsevier B.V. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据