基于强化学习的欠驱动水面艇有限时间最优包含控制及性能保证

Reinforcement Learning-Based Finite-Time Optimal Containment Control for Underactuated Surface Vehicles With Guaranteed Performance

IEEE Transactions on Systems, Man, and Cybernetics: Systems · 2024

被引 12

ABS 3

Lin Chen
Chao Dong
Shi‐Lu Dai

中文导读

提出一种在线强化学习算法，解决欠驱动水面艇在模型不确定和未知海扰下的分布式最优包含控制问题，使包含误差在有限时间内收敛并满足预设性能。

Abstract

This article presents an online reinforcement learning (RL) algorithm to learn the distributed optimal containment control solution for underactuated surface vehicles subject to modeling uncertainties and unknown ocean disturbances. First, the request of exact knowledge of vehicle system dynamics is avoided by constructing an adaptive neural network identifier. The unknown lumped disturbances are estimated by disturbance observers. Second, an actor-critic-based RL algorithm is developed to release the persistence of excitation condition. Third, based on RL algorithm and backstepping procedure, we design adaptive finite-time optimal containment controllers such that the containment errors converge into a small neighborhood around zero in finite time and satisfy the predefined performance specifications. Finally, simulation studies on underactuated surface vehicles are provided to verify the effectiveness of the presented optimal control algorithm.

强化学习欠驱动系统最优控制水面艇包含控制

阅读原文 ↗