Login / Signup
Strategy Complexity of Parity Objectives in Countable MDPs.
Stefan Kiefer
Richard Mayr
Mahsa Shirmohammadi
Patrick Totzke
Published in:
CoRR (2020)
Keyphrases
</>
average cost
reinforcement learning
markov decision processes
state space
search algorithm
markov chain
upper bound
worst case
multiple objectives
initial state
factored mdps