Login / Signup
Value Iteration for Simple Stochastic Games: Stopping Criterion and Learning Algorithm.
Edon Kelmendi
Julia Krämer
Jan Kretínský
Maximilian Weininger
Published in:
CAV (1) (2018)
Keyphrases
</>
stochastic games
markov decision processes
learning algorithm
stopping criterion
reinforcement learning algorithms
state space
average reward
reinforcement learning
supervised learning
optimal policy
learning agent
resource allocation
convergence rate
nash equilibrium
learning automata