面向轻量级终身深度强化学习的统一稀疏训练框架

A Unified Sparse Training Framework for Lightweight Lifelong Deep Reinforcement Learning

IEEE Transactions on Systems, Man, and Cybernetics: Systems · 2025

被引 1

ABS 3

Meng Xu
Xinhong Chen
Weiwei Fu
Y.S. Lin
Yung‐Hui Li
Jianping Wang

中文导读

提出一个通用稀疏训练框架，可集成到现有终身深度强化学习方法中，在实现90%稀疏性的同时提升回报率，最高提升34%。

Abstract

Lifelong deep reinforcement learning (DRL) methods enable continuous adaptation to new tasks and retention of old knowledge. However, these methods often necessitate large model sizes, leading to substantial computational and storage resource requirements during training and inference. Unfortunately, existing research has not yet provided a lightweight solution to address this issue. This work aims to develop a generic method that can be seamlessly integrated into existing lifelong DRL methods to facilitate their achievement of lightweight models while also yielding higher returns. While sparse training (ST) methods have been extensively used in the DRL community to achieve lightweight models, they exacerbate the issue of catastrophic forgetting and compromise generalization when applied in lifelong DRL. To improve generalization, we develop a gradient optimization method that leverages sharpness-aware minimization (SAM) to smooth the gradient surface of the model without introducing excessive computational complexity. In addition, to alleviate catastrophic forgetting and promote model convergence, we introduce a priority-based approach that samples effective past experiences from the replay buffer. Extensive experiments demonstrate that our approach achieves 90% sparsity in five representative lifelong DRL methods while achieving higher episode return and average return (up to 34% improvement) across all episodes compared to the dense models.

强化学习终身学习稀疏训练深度强化学习

阅读原文 ↗