prunAdag：一种自适应剪枝感知梯度方法

prunAdag: an adaptive pruning-aware gradient method

Computational Optimization and Applications · 2025

被引 0

ABS 3

Margherita Porcelli
Giovanni Seraghiti 通讯
Philippe L. Toint

中文导读

提出一种自适应剪枝感知梯度方法，将变量分为两组并采用不同更新策略，实现模型参数拟合问题的后验稀疏化，理论证明收敛且实验表现优于现有剪枝感知Frank-Wolfe算法。

Abstract

Abstract A pruning-aware adaptive gradient method is proposed which classifies the variables in two sets before updating them using different strategies. This technique extends the “relevant/irrelevant" approach of Ding et al. (Adv Neural Inf Process Syst 32, 2019) and Zimmer et al. (Mathematical optimization for machine learning: proceedings of the MATH+ thematic Einstein semester 2023, 2025) and allows a posteriori sparsification of the solution of model parameter fitting problems. The new method is proved to be convergent with a global rate of decrease of the averaged gradient’s norm of the form $$\mathcal{O}(\log (k)/\sqrt{k+1})$$ . Numerical experiments on several applications show that it is competitive with existing pruning-aware Frank-Wolfe algorithms, see e.g. Zimmer et al. (Mathematical optimization for machine learning: proceedings of the MATH+ thematic Einstein semester 2023, 2025).

机器学习优化算法模型压缩梯度方法

阅读原文 ↗