Sepuences of Games with Varying Opponents
研究玩家在序列博弈中面对不同对手、不同博弈和不完全信息时的行为,假设玩家基于对对手行为的平稳马尔可夫假设做出最优反应,并探讨假设在平均意义上正确的条件。
This paper considers a problem faced by players who are involved in a sequence of games: not necessarily the same games, not necessarily with the same opponents, and not necessarily under conditions of complete information. The players are assumed to act in response to stationary Markovian hypotheses which they form about the actions of their opponents. Conditions are explored which require that these hypotheses be correct on average and that the players actions be optimal in response to their hypotheses.