连续动作空间中多智能体深度强化学习模型的对抗攻击

Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space

IEEE Transactions on Systems, Man, and Cybernetics: Systems · 2024

被引 6

ABS 3

Ziyuan Zhou
Guanjun Liu
Weiran Guo
MengChu Zhou

中文导读

提出一种新的对抗攻击框架，通过识别随时间变化的关键智能体并扰动其动作，有效攻击多智能体深度强化学习模型，实验证明比现有方法扰动能力更强。

Abstract

Multiagent deep reinforcement learning (MADRL) has been recently applied in many fields, including industry 5.0, but it is sensitive to adversarial attacks. Although adversarial attacks can be detrimental, they are crucial for testing and assisting in enhancing the robustness of models. Existing attacks on MADRL-based models are not sufficient since these attacks involve fixed perturbed agents, without taking into account cases where perturbed agents change. In this article, we present a novel adversarial attack framework. In this framework, we define critical agents that change over time, i.e., when they are perturbed a little, the whole multiagent system is perturbed greatly. Then, we identify critical agents through their worst-case joint actions. In this identifying process, we use gradient information, differential evolution, and SARSA to deal with the challenge caused by changes in the perturbed agents and to compute the worst-case joint actions. After identifying them, we use the target attack method to perturb them. We apply our method to attack the models trained by two state-of-the-art MADRL algorithms under three environments, including two industry-related ones. The experimental results demonstrate our method has a stronger perturbing ability than the existing methods.

多智能体系统深度强化学习对抗攻击鲁棒性

阅读原文 ↗