Login / Signup
Explicit Kernel Rewards Regression for data-efficient near-optimal policy identification.
Daniel Schneegaß
Steffen Udluft
Thomas Martinetz
Published in:
ESANN (2007)
Keyphrases
</>
optimal policy
markov decision processes
decision problems
reinforcement learning
multistage
state space