Login / Signup
An adaptive policy gradient in learning Nash equilibria.
Huaxiang Zhang
Ying Fan
Published in:
Neurocomputing (2008)
Keyphrases
</>
multiagent learning
policy gradient
nash equilibria
learning process
stochastic games
learning algorithm
reinforcement learning
machine learning
multi agent
upper bound
learning tasks
game theoretic
actor critic
model free reinforcement learning