Temporal Difference Models: Model-Free Deep RL for Model-Based Control.
Vitchyr PongShixiang GuMurtaza DalalSergey LevinePublished in: CoRR (2018)
Keyphrases
- model free
- temporal difference
- reinforcement learning
- function approximation
- reinforcement learning algorithms
- td learning
- policy evaluation
- policy iteration
- temporal difference learning
- temporal difference methods
- rl algorithms
- machine learning algorithms
- control strategies
- average reward
- actor critic
- evaluation function
- markov decision processes
- monte carlo
- machine learning
- reinforcement learning methods