Login / Signup
Sequential targeted optimality as a new criterion for teaching and following in repeated games.
Max Knobbout
Gerard Vreeswijk
Published in:
AAMAS (2011)
Keyphrases
</>
repeated games
incomplete information
average reward
stochastic games
learning environment
game theoretic
e learning
learning process
markov decision processes
nash equilibrium
optimal solution
learning algorithm
reinforcement learning
computational complexity