Login / Signup

The Logarithmic Stochastic Tracing Procedure: A Homotopy Method to Compute Stationary Equilibria of Stochastic Games.

Steffen EibelshäuserVictor KlockmannDavid PoensgenAlicia von Schenk
Published in: INFORMS J. Comput. (2023)
Keyphrases
  • objective function
  • reinforcement learning
  • dynamic programming
  • linear programming
  • dynamic environments
  • nash equilibrium