Login / Signup
Kernel Rewards Regression: An Information Efficient Batch Policy Iteration Approach.
Daniel Schneegaß
Steffen Udluft
Thomas Martinetz
Published in:
Artificial Intelligence and Applications (2006)
Keyphrases
</>
markov decision processes
neural network
reinforcement learning
optimal policy