Sequential targeted optimality as a new criterion for teaching and following in repeated games.

Max Knobbout Gerard Vreeswijk

Published in: AAMAS (2011)

Keyphrases

repeated games
incomplete information
average reward
stochastic games
learning environment
game theoretic
e learning
learning process
markov decision processes
nash equilibrium
optimal solution
learning algorithm
reinforcement learning
computational complexity