Privacy-Preserving Realization of Fuzzy Clustering and Fuzzy Modeling Through Vertical Federated Learning
提出一种在纵向联邦学习环境下,基于垂直划分数据集、不共享原始数据而仅交换分区矩阵或梯度,实现隐私保护的模糊聚类与模糊规则建模方法,实验表明其性能接近集中式方法。
In this study, we elaborate on a realization of fuzzy clustering and the construction of fuzzy rule-based models on the basis of vertically partitioned datasets in a privacy-preserving federated learning approach. The main focus of the overall design process is to construct a family of information granules (clusters) and the corresponding fuzzy rules in the presence of a collection of vertically partitioned datasets without compromising data privacy. These datasets are composed of the same data but are described by different features, and due to security considerations, data cannot be shared. The vertical federated fuzzy clustering can be realized as an iterative optimization process composed of successive cycles: 1) computation (update) of the prototypes and partition matrices performed on the basis of local datasets and 2) an integration of the local sources of knowledge carried out on a central coordinator-server. The update of the partition matrices can be completed using a distance-based or gradient-based approach. The communication of findings between local clients and the coordinator-server is realized through exchanging partition matrices, which are more general than numeric data and can avoid leakage of data privacy. Fuzzy models are optimized in a similar manner through exchanging the gradients of the performance index computed with respect to the parameters between the clients and the global coordinator. The proposed mechanism exhibits significant originality since the realization of fuzzy modeling in a vertical federated learning environment has not been studied. Experimental studies show that the proposed federated clustering and fuzzy model design could effectively reveal the structure of the entire dataset and achieve high performance compared with the results obtained in a centralized manner.