Login / Signup
Robust Anytime Learning of Markov Decision Processes.
Marnix Suilen
Thiago D. Simão
David Parker
Nils Jansen
Published in:
NeurIPS (2022)
Keyphrases
</>
markov decision processes
reinforcement learning
learning algorithm
model based reinforcement learning
stochastic games
optimal policy
transition matrices
partially observable
dynamic programming
markov decision process
average reward
planning under uncertainty
risk sensitive
decision theoretic planning
state and action spaces
reachability analysis
supervised learning