Login / Signup
Value iteration for simple stochastic games: Stopping criterion and learning algorithm.
Julia Eisentraut
Edon Kelmendi
Jan Kretínský
Maximilian Weininger
Published in:
Inf. Comput. (2022)
Keyphrases
</>
stochastic games
learning algorithm
markov decision processes
stopping criterion
average reward
reinforcement learning algorithms
reinforcement learning
state space
nash equilibria
multi agent
optimal policy
learning agent
learning process
dynamic programming
least squares