Login / Signup
Learning in two-player zero-sum partially observable Markov games with perfect recall.
Tadashi Kozuno
Pierre Ménard
Rémi Munos
Michal Valko
Published in:
NeurIPS (2021)
Keyphrases
</>
partially observable
learning algorithm
reinforcement learning
markov decision processes
supervised learning
state space
markov decision problems
markov games
learning tasks
action selection
action models
partial observations