Strategy Complexity of Parity Objectives in Countable MDPs.

Stefan Kiefer Richard Mayr Mahsa Shirmohammadi Patrick Totzke

Published in: CoRR (2020)

Keyphrases

average cost
reinforcement learning
markov decision processes
state space
search algorithm
markov chain
upper bound
worst case
multiple objectives
initial state
factored mdps