对抗机器：人类如何与学习算法博弈

Rage against the machines: how subjects play against learning algorithms

Economic Theory · 2009

被引 45

人大 A-ABS 3

Peter Duersch
Albert Kolb
Jörg Oechssler 通讯
Burkhard C. Schipper

中文导读

通过大规模互联网实验，研究人类如何与不同学习算法（如最佳反应、虚拟博弈、模仿、强化学习等）对弈，发现人类会进行战略性教学来利用算法，但模仿算法难以被利用。

Abstract

We use a large-scale internet experiment to explore how subjects learn to play against computers that are programmed to follow one of a number of standard learning algorithms. The learning theories are (unbeknown to subjects) a best response process, fictitious play, imitation, reinforcement learning, and a trial & error process. We explore how subjects’ performances depend on their opponents’ learning algorithm. Furthermore, we test whether subjects try to influence those algorithms to their advantage in a forward-looking way (strategic teaching). We find that strategic teaching occurs frequently and that all learning algorithms are subject to exploitation with the notable exception of imitation.

人机博弈学习算法策略性教学算法利用

阅读原文 ↗