🌙

基于Shapley值的特征归因用于数据掩码

Shapley Value-Based Feature Attribution for Data Masking

MIS Quarterly · 2025
被引 0
人大 A+FT50UTD24ABS 4*

中文导读

提出一个基于Shapley值的特征归因框架,在特征层面平衡数据隐私中的披露风险与数据效用,适用于多种掩码方法和评估指标。

Abstract

Despite its many benefits, widespread access to individuals’ personal data also causes severe privacy concerns for consumers, companies, and policymakers. This study proposes a novel framework that adapts the Shapley value-based feature attribution approach to the problem domain of data privacy by capturing the two crucial dimensions of data privacy—disclosure risk and data utility. Our proposed framework takes a holistic view of data masking through a fair feature attribution approach based on Shapley values. Different from the existing literature that mostly focuses on the risk-utility trade-off at the dataset level, the proposed framework addresses the trade-off at the feature level. Furthermore, the proposed framework is agnostic to data masking methods, statistical and machine learning methods, and data utility and disclosure risk evaluation metrics. Experimental results show that our proposed method can effectively reduce disclosure risk while preserving data utility.

数据隐私特征归因数据掩码Shapley值