基于在线数据驱动的最优输出跟踪控制：无需初始稳定策略

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy

IEEE Transactions on Cybernetics · 2026

被引 0

ABS 3

Yongsheng Ma
Jian Sun
Yong Xu
Shisheng Cui
Li-Hua Dou

中文导读

针对未知模型的连续时间线性系统，提出一种双策略迭代学习算法，无需初始稳定策略或历史数据即可自适应学习最优跟踪控制器，仿真验证了其有效性。

Abstract

This article investigates the optimal output tracking control problem for a continuous-time linear system with an unknown system model. By integrating adaptive dynamic programming with optimal control theory, a dual policy iteration (PI) learning algorithm composed of two PI schemes is proposed to adaptively learn the optimal tracking controller. The primary advantage of the proposed algorithm lies in that it does not require an initial stabilizing control policy, persistence of excitation, or storage of historical data to guarantee convergence. This feature fundamentally distinguishes it from existing approaches based on the least-squares method, which rely on these conditions. Simulation results demonstrate the effectiveness of the proposed algorithm, and its superiority is further validated through comparisons with existing methods.

控制理论自适应动态规划最优控制数据驱动

阅读原文 ↗