Model-based Lookahead Reinforcement Learning.
Zhang-Wei HongJoni PajarinenJan PetersPublished in: CoRR (2019)
Keyphrases
- reinforcement learning
- model free
- function approximation
- temporal difference
- reinforcement learning algorithms
- real time
- learning algorithm
- direct policy search
- reinforcement learning methods
- state space
- artificial intelligence
- machine learning
- optimal control
- monte carlo
- artificial neural networks
- learning capabilities
- temporal difference learning