SR-GRAT：面向敏捷固定翼无人机控制的自适应目标引导对称响应强化学习

SR-GRAT: Symmetric-Response Guided Reinforcement Learning With Adaptive Targeting for Agile Fixed-Wing UAV Control

IEEE Transactions on Cybernetics · 2026

被引 0

ABS 3

Lun Li
Weizhe Chen
Chenxu Qian
Hang Du
Xuebo Zhang

中文导读

提出一种对称响应引导的强化学习框架，通过双向自适应学习曲线和动态目标调度，提升固定翼无人机在轨迹跟踪、航点导航和动态追击中的收敛速度与鲁棒性。

Abstract

This article presents a novel curriculum framework named symmetric-response guided reinforcement learning (RL) for autopilot control of fixed-wing aircraft, driven by an adaptive bidirectional learning curve and a dynamic target scheduling mechanism. Unlike traditional methods with static or overly smoothed learning progressions, the proposed method dynamically adjusts the learning curve's slope in both directions based on historical reward trends, allowing the learning intensity to increase or decrease as needed. This bidirectional adjustment ensures that the agent is neither overwhelmed by excessively difficult tasks nor stagnated by too-easy ones, leading to better stability and faster convergence. Furthermore, dynamic target generation within an episode from static target constraints enables both reward amplification and implicit enforcement of maneuver rate constraints, improving learning efficiency without manual reward shaping. Experiments on trajectory tracking tasks show that the proposed controller achieves faster convergence, reduced overshoot, and more accurate tracking under turbulence. Further tests on waypoint navigation and dynamic pursuit demonstrate its superiority over the baseline, achieving more precise and timely interception. These results highlight the robustness and applicability of the controller to complex aerial missions such as autonomous air combat.

强化学习无人机控制自动驾驶仪轨迹跟踪自适应学习

阅读原文 ↗