面向公平机器学习的战略最佳响应公平性框架

Strategic Best-Response Fairness Framework for Fair Machine Learning

Information Systems Research · 2025

被引 3

人大 AFT50UTD24ABS 4*

Hajime Shimao · 宾夕法尼亚州立大学
Warut Khern-am-nuai · 麦吉尔大学
Karthik Kannan · 亚利桑那大学
Maxime C. Cohen · 麦吉尔大学

中文导读

提出战略最佳响应公平性框架，评估机器学习算法在预测结果和行为响应两个层面是否真正消除歧视，发现常见公平算法未必有效，而均衡几率虽能实现但存在局限。

Abstract

This study introduces a framework called “strategic best-response fairness” (SBR-fairness) to address discrimination perpetuated by machine-learning (ML) algorithms. It challenges the conventional focus on fairness solely in prediction results, arguing that this approach ignores how individuals affected by the predictions may alter their behavior in response to algorithmic decisions. The framework considers whether an algorithm, trained on potentially biased data, leads to identical equilibrium behaviors across different subpopulations that are ex ante identical. The study finds that common fair-ML algorithms, such as those relying on color-blindness and demographic parity fairness criteria, do not always achieve SBR fairness. This means that they may not eliminate disparities in effort and outcomes. Equalized odds (EO), however, have been shown to achieve SBR fairness, but they suffer from several practical limitations. The study proposes that SBR fairness is a necessary condition for breaking cycles of discrimination in ML. It also argues that SBR fairness offers a complementary way to assess other fairness criteria and understand behavioral responses. The findings suggest a need for policy and practical focus on designing SBR-fair algorithms that promote equitable outcomes at both the prediction and behavioral level.

机器学习算法公平性行为经济学战略决策

阅读原文 ↗