Iteratively Extending Time Horizon Reinforcement Learning.

Damien Ernst Pierre Geurts Louis Wehenkel

Published in: ECML (2003)

Keyphrases

reinforcement learning
function approximation
reinforcement learning algorithms
state space
multi agent
model free
learning algorithm
markov decision processes
robotic control
policy search
transfer learning
partially observable
control problems
markov decision process
database
optimal policy
machine learning
optimal control
data mining
temporal difference
real world
neural network
relational reinforcement learning