Keyphrases
- policy gradient
- state action
- parametric optimization
- actor critic
- reinforcement learning
- optimal control
- approximation methods
- state transitions
- function approximation
- model free reinforcement learning
- neural network
- action space
- stochastic games
- reinforcement learning algorithms
- variance reduction
- evaluation function
- np hard