Login / Signup
Off-Policy Reward Shaping with Ensembles.
Anna Harutyunyan
Tim Brys
Peter Vrancx
Ann Nowé
Published in:
CoRR (2015)
Keyphrases
</>
reward shaping
reinforcement learning
complex domains
markov decision problems
reinforcement learning algorithms
state space
machine learning
learning algorithm
decision trees
markov decision processes
inductive learning
temporal difference