Rage against the machines: how subjects play against learning algorithms
通过大规模互联网实验,研究人类如何与不同学习算法(如最佳反应、虚拟博弈、模仿、强化学习等)对弈,发现人类会进行战略性教学来利用算法,但模仿算法难以被利用。
We use a large-scale internet experiment to explore how subjects learn to play against computers that are programmed to follow one of a number of standard learning algorithms. The learning theories are (unbeknown to subjects) a best response process, fictitious play, imitation, reinforcement learning, and a trial & error process. We explore how subjects’ performances depend on their opponents’ learning algorithm. Furthermore, we test whether subjects try to influence those algorithms to their advantage in a forward-looking way (strategic teaching). We find that strategic teaching occurs frequently and that all learning algorithms are subject to exploitation with the notable exception of imitation.