Stein Variational Policy Gradient.
Yang LiuPrajit RamachandranQiang LiuJian PengPublished in: UAI (2017)
Keyphrases
- policy gradient
- parametric optimization
- actor critic
- reinforcement learning
- function approximation
- gradient method
- image segmentation
- optimal control
- model free reinforcement learning
- partially observable markov decision processes
- variance reduction
- reinforcement learning algorithms
- average reward
- approximation methods
- neural network
- state action
- reinforcement learning methods