GRAC: Self-Guided and Self-Regularized Actor-Critic.
Lin ShaoYifan YouMengyuan YanShenli YuanQingyun SunJeannette BohgPublished in: CoRL (2021)
Keyphrases
- actor critic
- reinforcement learning
- policy gradient
- optimal control
- temporal difference
- gradient method
- approximate dynamic programming
- neuro fuzzy
- reinforcement learning algorithms
- function approximation
- policy iteration
- average reward
- least squares
- linear program
- markov decision processes
- dynamic programming
- fixed point
- optimization methods
- learning algorithm
- model free
- state space
- objective function
- decision making