Reinforcement Learning H ∞ Optimal Formation Control for Perturbed Multiagent Systems With Nonlinear Faults
针对存在未知非线性故障和外部扰动的多智能体系统,提出一种基于强化学习的最优编队控制策略,通过神经网络和故障观测器实现预设性能下的鲁棒控制。
This article presents an optimal formation control strategy for multiagent systems based on a reinforcement learning (RL) technique, considering prescribed performance and unknown nonlinear faults. To optimize the control performance, an RL strategy is introduced based on the identifier-critic–actor-disturbance structure and backstepping frame. The identifier, critic, actor, and disturbance neural networks (NNs) are employed to estimate unknown dynamics, assess system performance, carry out control actions, and derive the worst disturbance strategy, respectively. With the scheme, the persistent excitation requirements are removed by adopting simplified NNs updating laws, which are derived using the gradient descent method toward designed positive functions instead of the square of Bellman residual. For achieving the desired error precision within the prescribed time, a constraining function and an error transformation scheme are employed. In addition, to enhance the system’s robustness, a fault observer is utilized to compensate for the impact of the unknown nonlinear faults. The stability of the closed-loop system is assured, while the prescribed performance is realized. Finally, simulation examples validate the effectiveness of the proposed optimal control strategy.