对抗机器:人类如何与学习算法博弈

Rage against the machines: how subjects play against learning algorithms

Economic Theory · 2009
被引 45
人大 A-ABS 3

中文导读

通过大规模互联网实验,研究人类如何与不同学习算法(如最佳反应、虚拟博弈、模仿、强化学习等)对弈,发现人类会进行战略性教学来利用算法,但模仿算法难以被利用。

Abstract

We use a large-scale internet experiment to explore how subjects learn to play against computers that are programmed to follow one of a number of standard learning algorithms. The learning theories are (unbeknown to subjects) a best response process, fictitious play, imitation, reinforcement learning, and a trial & error process. We explore how subjects’ performances depend on their opponents’ learning algorithm. Furthermore, we test whether subjects try to influence those algorithms to their advantage in a forward-looking way (strategic teaching). We find that strategic teaching occurs frequently and that all learning algorithms are subject to exploitation with the notable exception of imitation.

人机博弈学习算法策略性教学算法利用