Login / Signup
Independent Natural Policy Gradient Always Converges in Markov Potential Games.
Roy Fox
Stephen McAleer
Will Overman
Ioannis Panageas
Published in:
CoRR (2021)
Keyphrases
</>
policy gradient
markov chain
optimal solution
nash equilibrium
game theoretic
learning algorithm
cost function
np hard
probability distribution
function approximation
actor critic