Login / Signup
A convex programming-based algorithm for mean payoff stochastic games with perfect information.
Endre Boros
Khaled M. Elbassioni
Vladimir Gurvich
Kazuhisa Makino
Published in:
Optim. Lett. (2017)
Keyphrases
</>
learning algorithm
dynamic programming
worst case
objective function
linear programming
convex programming
multi agent
search space
np hard
upper bound
markov decision processes
convergence rate
primal dual
stochastic games