异构环境下通过公平梯度裁剪实现阶最优的拜占庭鲁棒学习

Order-Optimal Byzantine-Robust Learning Under Heterogeneity via Fair Gradient Clipping

IEEE Transactions on Cybernetics · 2025

被引 1

ABS 3

Zhi-Yong Wang
Hao Nan Sheng
Qiushi Yang
Hing Cheung So

中文导读

提出一种新的聚合规则，根据梯度与聚合中心的距离进行裁剪，在数据异构下实现0.5的崩溃点和阶最优的拜占庭鲁棒训练误差，优于中位数方案。

Abstract

Byzantine-robust distributed or federated learning (FL) refers to providing reliable performance under Byzantine attacks, which violate the prescribed protocols and transmit arbitrary information to the server to hamper the convergence of machine learning (ML) algorithms, via designing resilient aggregation rules to combat attacks. Although numerous robust rules have been suggested, their performance degrades for heterogeneous data. A few techniques have been exploited to handle this problem, but they either require preaggregation operations, hence increasing the computational load, or lack breakdown point analysis of their rules. This article proposes a new aggregation rule, which clips the gradients received from all workers according to the distance between the gradient and the aggregation center. That is, when the distance is larger than the radius $\gamma $ , the gradient will be clipped, and the longer the distance, the closer the clipped gradient is to the center. We theoretically analyze that the breakdown point of the developed rule is 0.5, the maximum value for robust aggregators. Moreover, our rule achieves order-optimal Byzantine-robust training error under data heterogeneity, while the median-based schemes, such as coordinate-wise median (CM) and geometric median (GM), are suboptimal. Experimental results demonstrate that the devised aggregation mechanism can handle different attacks well and outperforms the existing rules.

分布式学习联邦学习鲁棒聚合拜占庭攻击梯度裁剪

阅读原文 ↗