Article
Computer Science, Artificial Intelligence
Xiong Yang, Yuanheng Zhu, Na Dong, Qinglai Wei
Summary: The study focuses on the decentralized event-driven control problem of nonlinear dynamical systems, transforming it into a group of nonlinear optimal control problems and solving it using event-driven Hamilton-Jacobi-Bellman equations to ensure overall system stability. The critic neural network architecture is utilized to address the problem, with Lyapunov method ensuring signal stability in closed-loop subsystems.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
(2022)
Article
Automation & Control Systems
Derong Liu, Shan Xue, Bo Zhao, Biao Luo, Qinglai Wei
Summary: This article reviews the recent development of adaptive dynamic programming (ADP) with applications in control, highlighting efficient algorithms and future research directions. ADP is applied in optimization, game theory, and large-scale systems, showing great potential in the era of artificial intelligence.
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
(2021)
Article
Automation & Control Systems
Antonio Sala, Leopoldo Armesto
Summary: This study introduces a new criterion for adaptive meshing in polyhedral partitions to interpolate value functions, employing an initial condition probability density function, uncertainty propagation, and temporal-difference error to determine the addition of new points. A collection of lemmas justifies the algorithmic proposal, with comparative analysis highlighting the advantages of this proposal over other options in literature. The developed methods are applied in simulation examples and an experimental robotic setup.
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE
(2022)
Article
Automation & Control Systems
Xiong Yang, Qinglai Wei
Summary: This article introduces an optimal event-driven control scheme for a continuous stirred tank reactor (CSTR) system, utilizing discounted cost and adaptive critic designs to solve the optimal event-driven control problem. Stability analysis of signals in the closed-loop system is conducted using the Lyapunov approach.
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
(2021)
Article
Automation & Control Systems
Mingming Ha, Ding Wang, Derong Liu
Summary: In this paper, a new approach is proposed to address the tracking control problem. By introducing a new cost function and a novel stability analysis method, the issue of incomplete elimination of tracking error in traditional approaches is solved. The specific implementation scheme for the special case of linear systems is also provided.
IEEE-CAA JOURNAL OF AUTOMATICA SINICA
(2022)
Article
Computer Science, Artificial Intelligence
Xiong Yang, Qinglai Wei
Summary: This article investigates the optimal event-triggered control problem of nonlinear continuous-time systems with asymmetric control constraints. A discounted cost is introduced to obtain the optimal solution, and an event-triggered Hamilton-Jacobi-Bellman equation and triggering condition are proposed to solve the problem.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
(2021)
Article
Automation & Control Systems
Yongwei Zhang, Bo Zhao, Derong Liu, Shunchao Zhang
Summary: This article investigates the event-triggered optimal tracking control problem for multiplayer unknown nonlinear systems using adaptive critic designs. It proposes a novel weight updating rule and derives the optimal tracking control policy for each player by solving coupled event-triggered Hamilton-Jacobi equations. Additionally, an event-triggering condition is designed to ensure the uniform ultimate boundedness of the closed-loop multiplayer systems.
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL
(2022)
Article
Automation & Control Systems
Qiliang Luo, Shan Xue, Derong Liu
Summary: In this article, a novel decentralised event-triggered control scheme is developed to stabilise interconnected systems. By restructuring interconnected systems in a distributed way and using adaptive critic designs with event-triggering mechanism, the robust control problem is transformed into an optimal control problem. The Lyapunov stability proof ensures the asymptotic stability of the whole system and the critic neural networks' ultimate boundedness.
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE
(2022)
Article
Multidisciplinary Sciences
Yanping Gao, Zuojun Liu
Summary: This paper proposes a novel adaptive fixed-time disturbance observer (AFXDO) based approximate optimal tracking control architecture for nonlinear systems with partially unknown dynamic drift and perturbation. The AFXDO is designed to attenuate the impact of disturbance without prior information, ensuring that the observer errors converge to zero in a fixed time. The approximate optimal control is achieved by incorporating the real-time estimation of AFXDO into a critic-only ADP framework, resulting in fast transient performance and low control consumptions.
Article
Computer Science, Artificial Intelligence
Ziyu Lin, Jingliang Duan, Shengbo Eben Li, Haitong Ma, Jie Li, Jianyu Chen, Bo Cheng, Jun Ma
Summary: The research addresses the challenge of solving the finite-horizon HJB equation, proposes a new algorithm, and validates its effectiveness through simulations.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
(2023)
Article
Computer Science, Artificial Intelligence
Yanwei Zhao, Ben Niu, Guangdeng Zong, Ning Xu, Adil M. Ahmad
Summary: This paper proposes an event-triggered adaptive dynamic programming (ETADP) algorithm to address the optimal decentralized control problem of interconnected nonlinear systems with stochastic dynamics. The algorithm converts the control problem into deriving optimal control policies in an aperiodic pattern. It utilizes an identifier-actor-critic network framework with updating laws constructed through the negative gradient method of a positive function. The stability of the closed-loop system is verified using Lyapunov theory, and a numerical simulation is provided to demonstrate the effectiveness of the proposed control scheme. (c) 2023 Elsevier B.V. All rights reserved.
Article
Automation & Control Systems
Mingming Ha, Ding Wang, Derong Liu
Summary: Inspired by the successive relaxation method, a novel discounted iterative adaptive dynamic programming framework is developed, which possesses an adjustable convergence rate for the iterative value function sequence. The convergence properties of the value function sequence and the stability of the closed-loop systems under the new discounted value iteration (VI) are investigated. An accelerated learning algorithm with convergence guarantee is presented based on the properties of the given VI scheme. Amidst the implementation of the new VI scheme and its accelerated learning design, value function approximation and policy improvement are involved. The performance of the developed approaches is verified using a nonlinear fourth-order ball-and-beam balancing plant, showing significant acceleration of the convergence rate of the value function and reduction in computational cost compared to traditional VI.
IEEE TRANSACTIONS ON CYBERNETICS
(2023)
Article
Computer Science, Artificial Intelligence
Longjie Zhang, Yong Chen
Summary: This article investigates the finite-time optimal control (FTOC) for affine-form nonlinear systems inspired by state optimization and finite-time convergence. The authors propose a novel finite-time adaptive dynamic programming (FTADP) approach to achieve optimal stability with finite response time. They also implement a novel adaptive dynamic programming (ADP) based on the finite-time critic-actor offline neural network (NN) approximation algorithm to solve the optimal controller with nonlinearity characteristic. The application analysis on circuit systems demonstrates the superiority of the proposed FTADP compared to general optimal control.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
(2023)
Article
Automation & Control Systems
Chuanhao Hu, Yuanyuan Zou, Shaoyuan Li
Summary: This paper presents an ADP-based decentralized event-triggered control strategy for large-scale nonlinear systems, utilizing event-triggered scheme and local neural networks observer to reduce communication cost and computational burden, with a decentralized triggering condition designed to ensure overall stabilization of the large-scale systems.
ASIAN JOURNAL OF CONTROL
(2022)
Article
Automation & Control Systems
Cong Li, Yongchao Wang, Fangzhou Liu, Qingchen Liu, Martin Buss
Summary: This article presents a new formulation for model-free robust optimal regulation of continuous-time nonlinear systems using an incremental adaptive dynamic programming (IADP) approach. It leverages time delay estimation (TDE) technique and measured input-state data to achieve incremental stabilization under uncertainties, disturbances, and saturation. Numerical simulations validate its effectiveness and superiority in reducing energy expenditure and enhancing robustness.
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL
(2022)
Article
Computer Science, Artificial Intelligence
Mingming Ha, Ding Wang, Derong Liu
Summary: In this article, a novel value iteration scheme is proposed, which introduces a relaxation factor and combines with other methods to accelerate and guarantee the convergence. The theoretical results and numerical examples demonstrate its fast convergence speed and stability.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
(2023)
Article
Computer Science, Artificial Intelligence
Mingduo Lin, Bo Zhao, Derong Liu
Summary: A novel policy gradient (PG) adaptive dynamic programming method is proposed for nonlinear discrete-time zero-sum games with unknown dynamics. A policy iteration algorithm is used to approximate the Q-function and the control and disturbance policies using neural network approximators. The control and disturbance policies are then updated using the PG method based on the iterative Q-function. The experience replay technique is applied to improve training stability and data usage efficiency. Simulation results show the effectiveness of the proposed method.
Article
Computer Science, Artificial Intelligence
Mingming Liang, Derong Liu
Summary: This article presents a novel neural-network-based optimal event-triggered impulsive control method. The proposed method utilizes a general-event-based impulsive transition matrix (GITM) to represent the evolving characteristics of all system states across impulsive actions. Through the developed event-triggered impulsive adaptive dynamic programming (ETIADP) algorithm and its high-efficiency version (HEIADP), the optimization problems for stochastic systems with event-triggered impulsive controls are addressed. The results show that the proposed methods can reduce computational and communication burdens and fulfill the desired goals.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
(2023)
Article
Automation & Control Systems
Mingming Ha, Ding Wang, Derong Liu
Summary: Inspired by the successive relaxation method, a novel discounted iterative adaptive dynamic programming framework is developed, which possesses an adjustable convergence rate for the iterative value function sequence. The convergence properties of the value function sequence and the stability of the closed-loop systems under the new discounted value iteration (VI) are investigated. An accelerated learning algorithm with convergence guarantee is presented based on the properties of the given VI scheme. Amidst the implementation of the new VI scheme and its accelerated learning design, value function approximation and policy improvement are involved. The performance of the developed approaches is verified using a nonlinear fourth-order ball-and-beam balancing plant, showing significant acceleration of the convergence rate of the value function and reduction in computational cost compared to traditional VI.
IEEE TRANSACTIONS ON CYBERNETICS
(2023)
Article
Automation & Control Systems
Runqi Chai, Derong Liu, Tianhao Liu, Antonios Tsourdos, Yuanqing Xia, Senchun Chai
Summary: This paper presents an integrated real-time trajectory planning and tracking control framework for autonomous ground vehicles (AGV) parking maneuver problems, utilizing deep neural networks and recurrent network structures. Two transfer learning strategies are applied to adapt the motion planner for different AGV types. Experimental studies demonstrate enhanced performance in fulfilling parking missions.
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING
(2023)
Article
Computer Science, Artificial Intelligence
Danyu Lin, Shan Xue, Derong Liu, Mingming Liang, Yonghua Wang
Summary: In this paper, a problem of multiplayer hierarchical decision-making for non-affine systems is solved using adaptive dynamic programming. The control dynamics are obtained and combined with the original system dynamics, transforming the non-affine multiplayer system into a general affine form. The hierarchical decision problem is modeled as a Stackelberg game, and a neural network is used to reconstruct the augmented system and approximate the value function. The feasibility and effectiveness of the algorithm are confirmed through simulation.
Article
Computer Science, Artificial Intelligence
Chujian Zeng, Bo Zhao, Derong Liu
Summary: This paper proposes a neuro-dynamic programming-based fault tolerant control scheme for a class of nonlinear systems, considering the occurrence of both actuator and sensor faults simultaneously. The scheme combines a descriptor observer with an adaptive observer to estimate system states and multiple faults. By employing a critic neural network, the approximate optimal control policy is obtained for the fault-free system. An FTC law is developed to suppress the influence of actuator faults by combining the estimations of actuator faults with the approximate optimal control policy. The stability of the closed-loop nonlinear system is analyzed using the Lyapunov stability theorem.
Proceedings Paper
Automation & Control Systems
Jinquan Lin, Bo Zhao, Derong Liu
Summary: In this paper, an integral reinforcement learning (IRL)-based approximate optimal control (AOC) method is developed for unknown nonaffine systems using dynamic feedback. The optimal control policy for nonaffine systems cannot be explicitly expressed due to the unknown input gain matrix. Thus, a dynamic feedback signal is introduced to transform the nonaffine system into an augmented affine system. The AOC for unknown nonaffine systems is formulated by designing an appropriate value function for the augmented affine system, and the IRL method is adopted to derive the approximate solution of the Hamilton-Jacobi-Bellman equation.
2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS
(2023)
Article
Automation & Control Systems
Bo Zhao, Guang Shi, Derong Liu
Summary: This article investigates local control problems for nonlinear interconnected systems by using adaptive dynamic programming (ADP) with particle swarm optimization (PSO). It constructs a proper local value function and employs a local critic neural network to solve the local Hamilton-Jacobi-Bellman equation. The event-triggering mechanism is introduced to determine the sampling time instants and ensure asymptotic stability through Lyapunov stability analysis.
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
(2023)
Article
Automation & Control Systems
Hui Wang, Zhigang Liu, Zhiwei Han, Yanbo Wu, Derong Liu
Summary: Active pantograph control is a promising technique for improving train's current collection quality. Existing solutions have limitations in handling various operating conditions and lack of adaptability. In this study, a context-based deep meta-reinforcement learning algorithm is proposed to alleviate these problems. Experimental results show that the proposed algorithm can quickly adapt to new conditions and reduce contact force fluctuations.
IEEE TRANSACTIONS ON CYBERNETICS
(2023)
Article
Computer Science, Artificial Intelligence
Zhanyu Yang, Bo Zhao, Derong Liu
Summary: In this article, a novel pinning control method that requires only partial node information is developed to synchronize drive-response memristor-based neural networks with time delay. An improved mathematical model of the networks is established to accurately describe their dynamic behaviors. Unlike previous literature that requires information from all nodes, the proposed method only relies on local information to achieve synchronization of delayed networks, reducing communication and calculation burdens. Sufficient conditions for synchronization are provided, and numerical simulation and comparative experiments validate the effectiveness and superiority of the proposed method.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
(2023)
Article
Automation & Control Systems
Mingduo Lin, Bo Zhao, Derong Liu
Summary: In this article, an event-triggered robust adaptive dynamic programming (ETRADP) algorithm is proposed to solve multiplayer Stackelberg-Nash games (MSNGs) for uncertain nonlinear continuous-time systems. The hierarchical decision-making process considering different roles of players is described, transforming the robust control problem into an optimal regulation problem. An online policy iteration algorithm is used to solve the derived Hamilton-Jacobi equation with an event-triggered mechanism to reduce computational and communication burdens. Critic neural networks (NNs) are constructed to obtain the event-triggered approximate optimal control policies for all players, ensuring the stability of the closed-loop system.
IEEE TRANSACTIONS ON CYBERNETICS
(2023)
Article
Automation & Control Systems
Mingming Liang, Derong Liu
Summary: This article focuses on designing the optimal impulsive controller (IMC) of continuous-time nonlinear systems and proposes a new adaptive dynamic programming algorithm with high generality and feasibility. The introduced policy-improving mechanism makes the algorithm more flexible for memory-limited computing devices.
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
(2023)
Article
Automation & Control Systems
Ke Wang, Chaoxu Mu, Zhen Ni, Derong Liu
Summary: This paper presents a novel composite obstacle avoidance control method that generates safe motion trajectories for autonomous systems in an adaptive manner. The method combines model-based policy iteration and state-following-based approximation in a safe reinforcement learning framework. The proposed learning-based controller achieves stable reaching of target points while maintaining a safe distance from obstacles. The effectiveness of the method is demonstrated through simulations and comparisons with other avoidance control methods.
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING
(2023)
Article
Computer Science, Artificial Intelligence
Bo Zhao, Yongwei Zhang, Derong Liu
Summary: This article presents a cooperative motion/force control scheme for modular reconfigurable manipulators (MRMs) based on adaptive dynamic programming (ADP). The dynamic model of the entire MRM system is treated as a set of joint modules interconnected by coupling torque, and the Jacobian matrix is mapped into each joint. A neural network is used as a robust decentralized observer, and an improved local value function is constructed for each joint module. The control scheme is achieved by using force feedback compensation and is proven to be uniformly ultimately bounded through Lyapunov stability analysis.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
(2023)