Reinforcement Learning with Deep Deterministic Policy Gradient.
Haining TanPublished in: CAIBDA (2021)
Keyphrases
- policy gradient
- reinforcement learning
- actor critic
- policy search
- function approximation
- reinforcement learning algorithms
- optimal control
- policy gradient methods
- model free reinforcement learning
- gradient method
- reinforcement learning methods
- approximation methods
- state space
- partially observable markov decision processes
- average reward
- state action
- function approximators
- approximate dynamic programming
- learning algorithm
- model free
- dynamic programming
- transfer learning
- optimal policy
- temporal difference learning
- control problems
- markov decision processes