☆ 4.6 Article

Optimal control for discrete-time affine non-linear systems using general value iteration

IET CONTROL THEORY AND APPLICATIONS (2012)

期刊

IET CONTROL THEORY AND APPLICATIONS

卷 6, 期 18, 页码 2725-2736

出版社

INST ENGINEERING TECHNOLOGY-IET

DOI: 10.1049/iet-cta.2011.0783

关键词

-

类别

Automation & Control Systems Engineering, Electrical & Electronic Instruments & Instrumentation

资金

National Natural Science Foundation of China [60904037, 60921061, 61034002]
Beijing Natural Science Foundation [4102061]
China Postdoctoral Science Foundation [201104162]

向作者/读者索取更多资源

Protocol

Reagent

摘要

In this study, the authors propose a novel adaptive dynamic programming scheme based on general value iteration (VI) to obtain near optimal control for discrete-time affine non-linear systems with continuous state and control spaces. First, the selection of initial value function is different from the traditional VI, and a new method is introduced to demonstrate the convergence property and convergence speed of value function. Then, the control law obtained at each iteration can stabilise the system under some conditions. At last, an error-bound-based condition is derived considering the approximation errors of neural networks, and then the error between the optimal and approximated value functions can also be estimated. To facilitate the implementation of the iterative scheme, three neural networks with Levenberg-Marquardt training algorithm are used to approximate the unknown system, the value function and the control law. Two simulation examples are presented to demonstrate the effectiveness of the proposed scheme.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6

评分不足

次要评分

新颖性

-

重要性

-

科学严谨性

-

评价这篇论文

推荐

Article Automation & Control Systems

Adaptive Optimal Control of Linear Periodic Systems: An Off-Policy Value Iteration Approach

Bo Pang, Zhong-Ping Jiang

Summary: This article studies the infinite-horizon adaptive optimal control of continuous-time linear periodic systems and proposes a novel value iteration-based off-policy adaptive dynamic programming algorithm for a general class of systems. The algorithm is proven to uniformly converge to optimal solutions in both model-based and model-free cases, without assuming knowledge of an initial stabilizing controller. Application to a triple inverted pendulum demonstrates the feasibility and effectiveness of the proposed method.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Reinforcement Learning and Adaptive Optimal Control for Continuous-Time Nonlinear Systems: A Value Iteration Approach

Tao Bian, Zhong-Ping Jiang

Summary: This article studies the adaptive optimal control problem for continuous-time nonlinear systems described by differential equations and proposes a new continuous-time value iteration method to address the limitations of existing methods. Adaptive optimal controllers for systems with unknown dynamics are obtained through this method, along with a learning-based control algorithm.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

添加到收藏夹

Article Automation & Control Systems

The linear quadratic optimal control problem for discrete-time Markov jump linear singular systems

Jorge R. Chavez-Fuentes, Eduardo F. Costa, Marco H. Terra, Kaio D. T. Rocha

Summary: This paper addresses the linear quadratic optimal control problem for discrete-time Markov jump linear singular systems, obtaining results under conditions that bring additional structure to the considered systems. The approach involves base transformations and control action restrictions to ensure regularity of the closed-loop system. The results are evaluated using an example.

AUTOMATICA (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Optimal Control for Constrained Discrete-Time Nonlinear Systems Based on Safe Reinforcement Learning

Lingzhi Zhang, Lei Xie, Yi Jiang, Zhishan Li, Xueqin Liu, Hongye Su

Summary: This article proposes a constrained optimal control approach for discrete-time nonlinear systems based on safe reinforcement learning. By introducing a barrier function, the constrained optimization problem is transformed into an unconstrained one, and a constrained policy iteration algorithm is developed to ensure optimal control and constraint satisfaction.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

添加到收藏夹

Article Acoustics

Optimal control design for linear time-varying systems by interpolated variational iteration method

Mohammad Shirazian

Summary: This paper proposes an improved approximate solution method for optimal control of linear time-varying systems. The optimality conditions are derived and the well-known variational iteration method, interpolated by B-spline functions, is applied to solve these conditions. The method is accelerated through redundant calculation elimination and does not require solving system equations or optimization problems. The convergence of the proposed method is proved and its efficiency is illustrated through several examples.

JOURNAL OF VIBRATION AND CONTROL (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

System Stability of Learning-Based Linear Optimal Control With General Discounted Value Iteration

Ding Wang, Jin Ren, Mingming Ha, Junfei Qiao

Summary: This article discusses the impact of the discount factor on the stabilization of control strategies. It presents methods to judge the stability of the controlled system and select appropriate discount factors. The practical rule for selecting discount factors is constructed based on the undiscounted optimal control problem.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

添加到收藏夹

Article Automation & Control Systems

Data-driven optimal tracking control of discrete-time linear systems with multiple delays via the value iteration algorithm

Longyan Hao, Chaoli Wang, Guang Zhang, Chonglin Jing, Yibo Shi

Summary: This paper studies the optimal tracking problem for discrete-time linear systems with multiple delays without system dynamics. A new data-driven value iteration algorithm is proposed, considering past control inputs, system outputs, and external reference trajectories. The algorithm transforms the original system according to the characteristics of the time-delay system, derives a novel data-driven state equation, and solves the optimal control of multi-delay systems. Results demonstrate the convergence of the algorithm and the asymptotic stability of the tracking error. Simulations show the effectiveness of the controller.

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE (2022)

添加到收藏夹

Article Automation & Control Systems

Modified general policy iteration based adaptive dynamic programming for unknown discrete-time linear systems

Huaiyuan Jiang, Bin Zhou, Guang-Ren Duan

Summary: This article studies the general policy iteration (GPI) method for optimal control of discrete-time linear systems. The existing result on the GPI method is recalled and some new properties are proposed. A model-based modified GPI algorithm is proposed based on these new properties, with its convergence proof provided. In addition, a data-driven implementation for the proposed method is introduced, which does not require the use of system matrices. The proposed algorithm further relaxes the condition to initiate the GPI based algorithm compared to existing results. The effectiveness of the proposed modified GPI based algorithm is verified through a simulation example.

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL (2022)

添加到收藏夹

Article Automation & Control Systems

Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems

Guangyu Zhu, Xiaolu Li, Ranran Sun, Yiyuan Yang, Peng Zhang

Summary: In this paper, a new iterative adaptive dynamic programming algorithm called discrete-time time-varying policy iteration (DTTV) algorithm is developed for infinite horizon optimal control problems of discrete time-varying nonlinear systems. The algorithm updates the iterative value function to approximate the index function of optimal performance. The admissibility and convergence properties of the iterative control law are analyzed.

IEEE-CAA JOURNAL OF AUTOMATICA SINICA (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Reinforcement Learning-Based Model Predictive Control for Discrete-Time Systems

Min Lin, Zhongqi Sun, Yuanqing Xia, Jinhui Zhang

Summary: This article proposes a novel reinforcement learning-based model predictive control (RLMPC) scheme that integrates model predictive control (MPC) and reinforcement learning (RL) through policy iteration (PI). The scheme improves the generated policy by using the obtained value function as the terminal cost of MPC, eliminating the need for the offline design paradigm of traditional MPC. RLMPC enables a more flexible choice of prediction horizon and shows superiority over traditional MPC for nonlinear systems.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

添加到收藏夹

Article Mathematics, Interdisciplinary Applications

Optimal Control of Unknown Discrete-Time Linear Systems with Additive Noise

Xue Yang, Shujun Liu

Summary: This paper investigates the optimal control problem with a long run average cost for unknown linear discrete-time systems with additive noise. The authors propose a value iteration-based stochastic adaptive dynamic programming (VI-based SADP) algorithm to obtain the optimal controller. Unlike existing work, this algorithm does not require estimation of the expectation and variance of states or other relevant variables, and its convergence can be rigorously proven. A simulation example is provided to verify the effectiveness of the proposed approach.

JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY (2023)

添加到收藏夹

Article Automation & Control Systems

Optimal decision strategy for discrete-time Markovian jump linear systems

Jin Zhu, Qingkun Zhang

Summary: This paper investigates the discrete-time Markovian jump linear systems (MJLSs) whose mode transition probability matrix (MTPM) can be adjusted by decisions. A decision strategy is proposed for stabilisation and optimisation of such MJLSs, considering the decision cost. The paper gives the feasible domain of decision for stable and unstable MJLSs with initial MTPM, introduces a generalised performance index, and presents a value iteration algorithm for optimisation.

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE (2023)

添加到收藏夹

Article Automation & Control Systems

Stochastic optimal control problems of discrete-time Markov jump systems

Teng Song

Summary: In this paper, the indefinite stochastic optimal control problems of discrete-time Markov jump linear systems are considered. A new stochastic maximum principle is established, and the necessary and sufficient solvability condition of the indefinite control problem with non-discounted cost is derived. The optimal control is designed using coupled generalized Riccati difference equations with Markov jump and linear recursive equations with Markov jump. An example of a defined-benefit pension fund with regime switching is provided to illustrate the validity of the obtained results.

OPTIMAL CONTROL APPLICATIONS & METHODS (2023)

添加到收藏夹

Article Automation & Control Systems

Optimal Learning Control Scheme for Discrete-Time Systems With Nonuniform Trials

Chen Liu, Xiaoe Ruan, Dong Shen, Hao Jiang

Summary: This article investigates an intermittent optimal learning control scheme that considers partially available information to address the issue of varying operational lengths in rehabilitation training. The proposed scheme achieves optimal learning gain by adopting the latest captured historical timewise input and tracking error.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

添加到收藏夹

Correction Automation & Control Systems

Global Practical Stabilization of Discrete-time Switched Affine Systems via a General Quadratic Lyapunov Function and a Decentralized Ellipsoid (vol 8, pg 1837, 2021)

Mohammad Hejri

Summary: This passage points out the need for corrections in the statements of Lemmas 2 and 4, as well as in the proofs of Lemma 2 and Prop. 3.

IEEE-CAA JOURNAL OF AUTOMATICA SINICA (2022)

添加到收藏夹

Article Automation & Control Systems

Event-triggered robust control for multi-player nonzero-sum games with input constraints and mismatched uncertainties

Shunchao Zhang, Bo Zhao, Derong Liu, Cesare Alippi, Yongwei Zhang

Summary: In this article, an event-triggered robust control (ETRC) method is investigated for multi-player nonzero-sum games of continuous-time input constrained nonlinear systems with mismatched uncertainties. The method transforms the robust control problem into an optimal regulation problem by constructing an auxiliary system and designing an appropriate value function. A critic neural network (NN) is used to approximate the value function of each player and obtain control laws. The method reduces computational burden and communication bandwidth by updating the control laws when events occur. The effectiveness of the developed ETRC method is demonstrated through theoretical analysis and examples.

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Event-triggered adaptive dynamic programming for decentralized tracking control of input constrained unknown nonlinear interconnected systems

Qiuye Wu, Bo Zhao, Derong Liu, Marios M. Polycarpou

Summary: This paper proposes an event-triggered adaptive dynamic programming method to solve the decentralized tracking control problem for input constrained unknown nonlinear interconnected systems. A neural-network-based local observer is established to reconstruct the system dynamics using local input-output data and desired trajectories. The DTC problem is transformed into an optimal control problem using a nonquadratic value function. The DTC policy is obtained by solving the local Hamilton-Jacobi-Bellman equation through the observer-critic architecture, with weights tuned by the experience replay technique. Simulation examples demonstrate the effectiveness of the proposed scheme.

NEURAL NETWORKS (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Policy gradient adaptive dynamic programming for nonlinear discrete-time zero-sum games with unknown dynamics

Mingduo Lin, Bo Zhao, Derong Liu

Summary: A novel policy gradient (PG) adaptive dynamic programming method is proposed for nonlinear discrete-time zero-sum games with unknown dynamics. A policy iteration algorithm is used to approximate the Q-function and the control and disturbance policies using neural network approximators. The control and disturbance policies are then updated using the PG method based on the iterative Q-function. The experience replay technique is applied to improve training stability and data usage efficiency. Simulation results show the effectiveness of the proposed method.

SOFT COMPUTING (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Neuro-Optimal Event-Triggered Impulsive Control for Stochastic Systems via ADP

Mingming Liang, Derong Liu

Summary: This article presents a novel neural-network-based optimal event-triggered impulsive control method. The proposed method utilizes a general-event-based impulsive transition matrix (GITM) to represent the evolving characteristics of all system states across impulsive actions. Through the developed event-triggered impulsive adaptive dynamic programming (ETIADP) algorithm and its high-efficiency version (HEIADP), the optimization problems for stochastic systems with event-triggered impulsive controls are addressed. The results show that the proposed methods can reduce computational and communication burdens and fulfill the desired goals.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

添加到收藏夹

Article Automation & Control Systems

Novel Discounted Adaptive Critic Control Designs With Accelerated Learning Formulation

Mingming Ha, Ding Wang, Derong Liu

Summary: Inspired by the successive relaxation method, a novel discounted iterative adaptive dynamic programming framework is developed, which possesses an adjustable convergence rate for the iterative value function sequence. The convergence properties of the value function sequence and the stability of the closed-loop systems under the new discounted value iteration (VI) are investigated. An accelerated learning algorithm with convergence guarantee is presented based on the properties of the given VI scheme. Amidst the implementation of the new VI scheme and its accelerated learning design, value function approximation and policy improvement are involved. The performance of the developed approaches is verified using a nonlinear fourth-order ball-and-beam balancing plant, showing significant acceleration of the convergence rate of the value function and reduction in computational cost compared to traditional VI.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

添加到收藏夹

Article Automation & Control Systems

Adaptive Dynamic Programming-Based Event-Triggered Robust Control for Multiplayer Nonzero-Sum Games With Unknown Dynamics

Yongwei Zhang, Bo Zhao, Derong Liu, Shunchao Zhang

Summary: In this article, the event-triggered robust control problem of unknown multiplayer nonlinear systems with constrained inputs and uncertainties is investigated using adaptive dynamic programming. A neural network-based identifier is constructed to relax the requirement of system dynamics. By designing a nonquadratic value function, the stabilization problem is converted into a constrained optimal control problem. The approximate solution of the event-triggered Hamilton-Jacobi equation is obtained using a critic network with a novel weight updating law, and the Lyapunov stability theorem ensures that the multiplayer system is uniformly ultimately bounded.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

添加到收藏夹

Article Automation & Control Systems

An Efficient Impulsive Adaptive Dynamic Programming Algorithm for Stochastic Systems

Mingming Liang, Yonghua Wang, Derong Liu

Summary: In this study, a novel general impulsive transition matrix is defined to reveal the transition dynamics and probability distribution evolution patterns between impulsive events. Based on this matrix, policy iteration-based impulsive adaptive dynamic programming algorithms are developed to solve optimal impulsive control problems. The algorithms demonstrate convergence to the optimal impulsive performance index function and allow for optimization on computing devices with low memory spaces.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

添加到收藏夹

Proceedings Paper Automation & Control Systems

Data-Based Approximate Optimal Control for Unknown Nonaffine Systems via Dynamic Feedback

Jinquan Lin, Bo Zhao, Derong Liu

Summary: In this paper, an integral reinforcement learning (IRL)-based approximate optimal control (AOC) method is developed for unknown nonaffine systems using dynamic feedback. The optimal control policy for nonaffine systems cannot be explicitly expressed due to the unknown input gain matrix. Thus, a dynamic feedback signal is introduced to transform the nonaffine system into an augmented affine system. The AOC for unknown nonaffine systems is formulated by designing an appropriate value function for the augmented affine system, and the IRL method is adopted to derive the approximate solution of the Hamilton-Jacobi-Bellman equation.

2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS (2023)

添加到收藏夹

Article Automation & Control Systems

Event-Triggered Local Control for Nonlinear Interconnected Systems Through Particle Swarm Optimization-Based Adaptive Dynamic Programming

Bo Zhao, Guang Shi, Derong Liu

Summary: This article investigates local control problems for nonlinear interconnected systems by using adaptive dynamic programming (ADP) with particle swarm optimization (PSO). It constructs a proper local value function and employs a local critic neural network to solve the local Hamilton-Jacobi-Bellman equation. The event-triggering mechanism is introduced to determine the sampling time instants and ensure asymptotic stability through Lyapunov stability analysis.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2023)

添加到收藏夹

Article Automation & Control Systems

Rapid Adaptation for Active Pantograph Control in High-Speed Railway via Deep Meta Reinforcement Learning

Hui Wang, Zhigang Liu, Zhiwei Han, Yanbo Wu, Derong Liu

Summary: Active pantograph control is a promising technique for improving train's current collection quality. Existing solutions have limitations in handling various operating conditions and lack of adaptability. In this study, a context-based deep meta-reinforcement learning algorithm is proposed to alleviate these problems. Experimental results show that the proposed algorithm can quickly adapt to new conditions and reduce contact force fluctuations.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Synchronization of Delayed Memristor-Based Neural Networks via Pinning Control With Local Information

Zhanyu Yang, Bo Zhao, Derong Liu

Summary: In this article, a novel pinning control method that requires only partial node information is developed to synchronize drive-response memristor-based neural networks with time delay. An improved mathematical model of the networks is established to accurately describe their dynamic behaviors. Unlike previous literature that requires information from all nodes, the proposed method only relies on local information to achieve synchronization of delayed networks, reducing communication and calculation burdens. Sufficient conditions for synchronization are provided, and numerical simulation and comparative experiments validate the effectiveness and superiority of the proposed method.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

添加到收藏夹

Article Automation & Control Systems

Event-Triggered Robust Adaptive Dynamic Programming for Multiplayer Stackelberg-Nash Games of Uncertain Nonlinear Systems

Mingduo Lin, Bo Zhao, Derong Liu

Summary: In this article, an event-triggered robust adaptive dynamic programming (ETRADP) algorithm is proposed to solve multiplayer Stackelberg-Nash games (MSNGs) for uncertain nonlinear continuous-time systems. The hierarchical decision-making process considering different roles of players is described, transforming the robust control problem into an optimal regulation problem. An online policy iteration algorithm is used to solve the derived Hamilton-Jacobi equation with an event-triggered mechanism to reduce computational and communication burdens. Critic neural networks (NNs) are constructed to obtain the event-triggered approximate optimal control policies for all players, ensuring the stability of the closed-loop system.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

添加到收藏夹

Article Automation & Control Systems

Liquid-Updating Impulsive Adaptive Dynamic Programming for Continuous Nonlinear Systems

Mingming Liang, Derong Liu

Summary: This article focuses on designing the optimal impulsive controller (IMC) of continuous-time nonlinear systems and proposes a new adaptive dynamic programming algorithm with high generality and feasibility. The introduced policy-improving mechanism makes the algorithm more flexible for memory-limited computing devices.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2023)

添加到收藏夹

Article Automation & Control Systems

Safe Reinforcement Learning and Adaptive Optimal Control With Applications to Obstacle Avoidance Problem

Ke Wang, Chaoxu Mu, Zhen Ni, Derong Liu

Summary: This paper presents a novel composite obstacle avoidance control method that generates safe motion trajectories for autonomous systems in an adaptive manner. The method combines model-based policy iteration and state-following-based approximation in a safe reinforcement learning framework. The proposed learning-based controller achieves stable reaching of target points while maintaining a safe distance from obstacles. The effectiveness of the method is demonstrated through simulations and comparisons with other avoidance control methods.

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Adaptive Dynamic Programming-Based Cooperative Motion/Force Control for Modular Reconfigurable Manipulators: A Joint Task Assignment Approach

Bo Zhao, Yongwei Zhang, Derong Liu

Summary: This article presents a cooperative motion/force control scheme for modular reconfigurable manipulators (MRMs) based on adaptive dynamic programming (ADP). The dynamic model of the entire MRM system is treated as a set of joint modules interconnected by coupling torque, and the Jacobian matrix is mapped into each joint. A neural network is used as a robust decentralized observer, and an improved local value function is constructed for each joint module. The control scheme is achieved by using force feedback compensation and is proven to be uniformly ultimately bounded through Lyapunov stability analysis.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

添加到收藏夹

暂无数据

© Peeref 2019-2024. All rights reserved.