Keyphrases
- policy gradient
- optimal control
- actor critic
- reinforcement learning
- parametric optimization
- state space
- dynamic programming
- markov chain
- model free reinforcement learning
- control problems
- gradient method
- reinforcement learning algorithms
- control strategy
- dynamical systems
- function approximation
- partially observable markov decision processes
- infinite horizon
- average reward
- control system