多通道博弈中的收益控制：影响对手学习演化

Payoff Control in Multichannel Games: Influencing Opponent Learning Evolution

IEEE Transactions on Cybernetics · 2025

被引 8

ABS 3

Juan Shi
Chen Chu
Guoxi Fan
Die Hu
Jinzhuo Liu
Zhen Wang

中文导读

提出两种收益控制策略（部分控制和完全控制），使单个智能体在多通道重复博弈中限制对手的期望收益总和，并引导对手学习向期望均衡演化。

Abstract

In this article, we introduce a new theory for payoff control in multichannel learning environments, where agents interact with each other over multiple channels and each channel is a repeated normal form game. We propose two payoff control strategies-partial control and full control-that allow a single agent to set an upper bound to the opponent's expected payoffs summed across all channels, even if the opponent is a reinforcement learning agent. We prove that a partial (or full) control strategy can be obtained by solving a system of inequalities, and characterize the conditions under which such a partial (or full) control strategy exists. We show that by utilizing these control strategies, the agent can influence the opponent's learning evolution and direct it toward a desired viable equilibrium. Our experiments confirm the effectiveness of our theory for payoff control in a wide range of multichannel learning environments.

博弈论强化学习多智能体系统控制理论

阅读原文 ↗