Econometrica: Mar, 2005, Volume 73, Issue 2
A Partial Folk Theorem for Games with Unknown Payoff Distributions
https://doi.org/10.1111/j.1468-0262.2005.00589.x
p. 629-645
Thomas Wiseman
Repeated games with unknown payoff distributions are analogous to a single decision maker's “multi‐armed bandit” problem. Each state of the world corresponds to a different payoff matrix of a stage game. When monitoring is perfect, information about the state is public, and players are sufficiently patient, the following result holds: For any function that maps each state to a payoff vector that is feasible and individually rational in that state, there is a sequential equilibrium in which players experiment to learn the realized state and achieve a payoff close to the one specified for that state.