AC2: A Policy Gradient Actor with Primary and Secondary Critics.
Alfonso B. LabaoProspero C. NavalPublished in: IJCNN (2018)
Keyphrases
- policy gradient
- actor critic
- reinforcement learning
- parametric optimization
- function approximation
- optimal control
- gradient method
- approximation methods
- variance reduction
- model free reinforcement learning
- artificial neural networks
- average reward
- multi agent
- simulated annealing
- radial basis function
- evaluation function