Keyphrases
- policy gradient
- state action
- actor critic
- parametric optimization
- reinforcement learning
- policy search
- gradient method
- function approximation
- model free reinforcement learning
- reinforcement learning algorithms
- action selection
- average reward
- optimal control
- approximation methods
- evaluation function
- markov chain
- control system