PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient.
Kaixin WangDaquan ZhouJiashi FengShie MannorPublished in: ICML (2023)
Keyphrases
- policy gradient
- reinforcement learning
- actor critic
- function approximation
- independent component analysis
- gradient method
- model free reinforcement learning
- optimal control
- parametric optimization
- reinforcement learning algorithms
- approximation methods
- variance reduction
- reinforcement learning methods
- single agent
- state space
- dynamic programming
- state action
- markov decision processes