基于最小-最大分布式模型预测控制的非线性多智能体系统安全强化学习

Safe Reinforcement Learning for Nonlinear Multiagent Systems Based on Min–Max DMPC

IEEE Transactions on Cybernetics · 2026

被引 0

ABS 3

Yang Peng
Huaicheng Yan
Qiwei Liu
Haining Yan
Yuanyi Zheng
Youmin Zhang

中文导读

提出一种安全强化学习框架，结合最小-最大分布式模型预测控制，用于非线性多智能体系统，在保证安全的同时在线更新控制器参数以降低保守性，并通过仿真验证了效果。

Abstract

This article presents a safe reinforcement learning (RL) framework for nonlinear multiagent systems (MASs) based on min-max distributed model predictive control (DMPC). The proposed method employs min-max DMPC as a robust baseline to generate control strategies, optimizing closed-loop performance in the presence of disturbances while ensuring interpretability and safety. To mitigate the conservatism inherent in traditional robust DMPC due to its reliance on precise models and fixed disturbance bounds, safe RL is introduced to adaptively update the controller parameters and disturbance sets online. The proposed parameter update mechanism of safe RL formally guarantees the recursive feasibility of the DMPC algorithm during the learning process. Furthermore, theoretical analyses of closed-loop stability are provided. The effectiveness and scalability of the proposed method are validated through two simulation examples.

强化学习非线性系统多智能体系统分布式模型预测控制

阅读原文 ↗