自主算法合谋：序贯定价下的Q学习

Autonomous algorithmic collusion: Q‐learning under sequential pricing

RAND Journal of Economics · 2021

被引 211 · 同刊同年前 3%

人大 AFT50ABS 4

Timo Klein · 乌得勒支大学经济学院通讯

中文导读

研究在序贯竞争模拟中，强化学习算法如何在有限离散价格集下学会合谋，以及价格集扩大时如何收敛到超竞争的不对称周期，并讨论政策含义。

Abstract

Abstract Prices are increasingly set by algorithms. One concern is that intelligent algorithms may learn to collude on higher prices even in the absence of the kind of coordination necessary to establish an antitrust infringement. However, exactly how this may happen is an open question. I show how in simulated sequential competition, competing reinforcement learning algorithms can indeed learn to converge to collusive equilibria when the set of discrete prices is limited. When this set increases, the algorithm considered increasingly converges to supra‐competitive asymmetric cycles. I show that results are robust to various extensions and discuss practical limitations and policy implications.

强化学习算法合谋定价序贯定价算法共谋

阅读原文 ↗