基于固定时间收敛强化学习的多无人机系统安全包含控制

Secure Containment Control for Multi-UAV Systems by Fixed-Time Convergent Reinforcement Learning

IEEE Transactions on Cybernetics · 2025

被引 41 · 同刊同年前 1%

ABS 3

Feisheng Yang
Zhenyu Gong
Qinglai Wei
Yifei Lei

中文导读

研究了多无人机系统在遭受网络攻击时的安全包含控制问题，将攻防过程建模为零和博弈，用固定时间收敛的强化学习方法求解最优控制策略，并通过仿真和四旋翼实验验证了有效性。

Abstract

This article concerns the secure containment control problem for multiple autonomous aerial vehicles. The cyber attacker can manipulate control commands, resulting in containment failure in the position loop. Within a zero-sum graphical game framework, secure containment controllers and malicious attackers are regarded as game players, and the attack-defense process is recast as a min-max optimization problem. Acquiring optimal distributed secure control policies requires solving the game-related Hamilton-Jacobi-Isaacs (HJI) equations. Based on the critic-only neural network (NN) structure, the reinforcement learning (RL) method is employed in solving coupled HJI equations. The fixed-time convergence technique is introduced to improve the convergence rate of RL, and the experience replay mechanism is utilized to relax the persistence of excitation condition. The associated NN convergence and closed-loop stability are analyzed. In the attitude loop, the optimal feedback control law is obtained by solving Hamilton-Jacobi-Bellman equations using the fixed-time convergent RL method. The simulation example and the quadrotor experiment are given to show the effectiveness of the proposed scheme.

多无人机系统安全控制强化学习包含控制零和博弈

阅读原文 ↗