Login / Signup
Robust Anytime Learning of Markov Decision Processes.
Marnix Suilen
Thiago D. Simão
Nils Jansen
David Parker
Published in:
CoRR (2022)
Keyphrases
</>
markov decision processes
reinforcement learning
state space
learning algorithm
average reward
stochastic games
decision theoretic planning
model based reinforcement learning
finite state
partially observable
function approximation
policy iteration
planning under uncertainty
risk sensitive
macro actions