Login / Signup
Learning Nash Equilibrium for General-Sum Markov Games from Batch Data.
Julien Pérolat
Florian Strub
Bilal Piot
Olivier Pietquin
Published in:
CoRR (2016)
Keyphrases
</>
nash equilibrium
reinforcement learning
special case
learning process
dynamic programming
probability distribution
game theoretic