Model-free Trajectory Optimization for Reinforcement Learning.
Riad AkrourAbbas AbdolmalekiHany AbdulsamadGerhard NeumannPublished in: CoRR (2016)
Keyphrases
- model free
- reinforcement learning
- reinforcement learning algorithms
- function approximation
- temporal difference
- policy iteration
- markov decision processes
- learning algorithm
- reinforcement learning methods
- action selection
- state space
- machine learning
- rl algorithms
- optimal policy
- function approximators
- average reward
- policy evaluation
- impedance control