Keyphrases
- policy gradient
- parametric optimization
- actor critic
- reinforcement learning
- function approximation
- gradient method
- optimal control
- reinforcement learning algorithms
- approximation methods
- partially observable markov decision processes
- model free reinforcement learning
- variance reduction
- average reward
- neural network
- feature maps
- radial basis function
- machine learning