Login / Signup
Uniform value for recursive games with compact action sets.
Xiaoxi Li
Sylvain Sorin
Published in:
Oper. Res. Lett. (2016)
Keyphrases
</>
action sets
reinforcement learning
markov decision processes
finite state
state space
data mining
learning algorithm
optimal policy
control system
monte carlo
model checking
production system
average cost