面向未知动力学非线性时滞系统的滚动时域演员-评论家学习控制

Receding Horizon Actor–Critic Learning Control for Nonlinear Time-Delay Systems With Unknown Dynamics

IEEE Transactions on Systems, Man, and Cybernetics: Systems · 2023

被引 17

ABS 3

Jiahang Liu
Xinglong Zhang
Xin Xu
Quan Xiong

中文导读

提出一种滚动时域演员-评论家学习方法，用于未知动力学的非线性时滞系统近最优控制，通过数据驱动预测和松弛终端条件降低计算成本，仿真表明性能优于非线性模型预测控制。

Abstract

With the development of modern mechatronics and networked systems, the controller design of time-delay systems has received notable attention. Time delays can greatly influence the stability and performance of the systems, especially for optimal control design. In this article, we propose a receding horizon actor–critic learning control approach for near-optimal control of nonlinear time-delay systems (RACL-TD) with unknown dynamics. In the proposed approach, a data-driven predictor for nonlinear time-delay systems is first learned based on the Koopman theory using precollected samples. Then, a receding horizon actor–critic architecture is designed to learn a near-optimal control policy. In RACL-TD, the terminal cost is determined by using the Lyapunov–Krasovskii approach so that the influences of the delayed states and control inputs can be well addressed. Furthermore, a relaxed terminal condition is present to reduce the computational cost. The convergence and optimality of RACL-TD in each prediction interval as well as the closed-loop property of the system are discussed and analyzed. Simulation results on a two-stage time-delayed chemical reactor illustrate that RACL-TD can achieve better control performance than nonlinear model predictive control (MPC) and infinite-horizon adaptive dynamic programming. Moreover, RACL-TD can have less computational cost than nonlinear MPC.

控制理论非线性系统时滞系统最优控制自适应动态规划

阅读原文 ↗