Login / Signup
Kernel-based reinforcement learning in average-cost problems.
Dirk Ormoneit
Peter W. Glynn
Published in:
IEEE Trans. Autom. Control. (2002)
Keyphrases
</>
reinforcement learning
average cost
markov decision processes
optimal policy
finite state
long run
markov decision process
control policy
reinforcement learning methods
state space
finite number
infinite horizon
temporal difference