基于状态势博弈的分布式Stackelberg策略在自主分散学习制造系统中的应用

Distributed Stackelberg Strategies in State-Based Potential Games for Autonomous Decentralized Learning Manufacturing Systems

IEEE Transactions on Systems, Man, and Cybernetics: Systems · 2025

被引 7

ABS 3

Steve Yuwono
Dorothea Schwung
Andreas Schwung

中文导读

提出DS2-SbPG框架，结合势博弈与Stackelberg博弈，在分散制造系统中实现多目标优化，无需设置组合目标函数，实验显示可降低10.61%能耗并提升系统性能。

Abstract

This article presents a novel game-theoretical (GT) framework, distributed Stackelberg strategies in state-based potential games (DS2-SbPGs), for autonomous multiobjective optimization in decentralized manufacturing systems. Existing approaches, including multiagent reinforcement learning (MARL) and native SbPG, struggle with scalability, coordination inefficiencies, and the complexity of tuning combined objective functions in real-world settings. DS2-SbPG integrates potential games and Stackelberg games, which improves the cooperative tradeoff capabilities of potential games and the multiobjective optimization handling by Stackelberg games. Notably, all training procedures are conducted in a fully distributed manner. DS2-SbPG offers a promising solution to finding optimal tradeoffs between objectives by eliminating the complexities of setting up combined objective optimization functions for individual players in self-learning domains, particularly in real-world industrial settings with diverse and numerous objectives between the subsystems. We formally prove that DS2-SbPG constitutes a dynamic potential game with guaranteed convergence. Experimental validation on a laboratory-scale testbed demonstrates the effectiveness of DS2-SbPG and its two variants: one with a single-leader–follower structure and another (Stack DS2-SbPG) for multileader–follower scenarios. Both variants significantly outperform native SbPG, achieving up to 10.61% reduction in power consumption while enhancing overall system performance, which signals the potential of DS2-SbPG in real-world applications.

博弈论制造系统多智能体强化学习分布式优化

阅读原文 ↗