面向数据驱动输出跟踪控制的智能评论学习与轻量级并行化

Intelligent Critic Learning for Data-Driven Output Tracking Control With Lightweight Parallelization

IEEE Transactions on Systems, Man, and Cybernetics: Systems · 2025

被引 1

ABS 3

Jiangyu Wang
Ding Wang
Jin Ren
Junfei Qiao

中文导读

提出一种数据驱动的并行Q学习算法，通过设计直接关联系统状态的效用函数和双轻量控制器，解决传统折扣方法中的不稳定、误差放大和早熟收敛问题，消除跟踪误差并加速收敛。

Abstract

This article investigates critical challenges in optimal output tracking control, including residual tracking errors, potential instability caused by discount factors, and premature convergence due to inefficient termination criteria. We tackle these issues by developing a data-driven parallel <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Q</i>-learning algorithm. Specifically, a utility function directly linked to system states is proposed to avoid the instability and error amplification issues in traditional discounted approaches. In addition, the algorithm uses dual lightweight controllers that use convergence properties to enhance learning efficiency. Based on dual controllers, a novel termination criterion is introduced to prevent premature convergence during the training process. Numerical simulations demonstrate that the proposed method eliminates tracking errors, accelerates convergence compared with traditional algorithms, and ensures stable convergence across diverse system dynamics.

最优控制强化学习数据驱动控制输出跟踪

阅读原文 ↗