Iteratively Extending Time Horizon Reinforcement Learning.
Damien ErnstPierre GeurtsLouis WehenkelPublished in: ECML (2003)
Keyphrases
- reinforcement learning
- function approximation
- reinforcement learning algorithms
- state space
- multi agent
- model free
- learning algorithm
- markov decision processes
- robotic control
- policy search
- transfer learning
- partially observable
- control problems
- markov decision process
- database
- optimal policy
- machine learning
- optimal control
- data mining
- temporal difference
- real world
- neural network
- relational reinforcement learning