Self-organized Reinforcement Learning Based on Policy Gradient in Nonstationary Environments.
Yu HieiTakeshi MoriShin IshiiPublished in: ICANN (1) (2008)
Keyphrases
- non stationary
- policy gradient
- reinforcement learning
- actor critic
- function approximation
- reinforcement learning algorithms
- policy search
- policy gradient methods
- optimal control
- model free reinforcement learning
- random fields
- function approximators
- dynamic programming
- gradient method
- variance reduction
- reinforcement learning methods
- state space
- temporal difference
- model free
- approximation methods
- state action
- approximate dynamic programming
- multi agent
- learning algorithm