Causal Policy Gradient for Whole-Body Mobile Manipulation.
Jiaheng HuPeter StoneRoberto Martín-MartínPublished in: CoRR (2023)
Keyphrases
- policy gradient
- parametric optimization
- actor critic
- reinforcement learning
- function approximation
- optimal control
- reinforcement learning algorithms
- gradient method
- model free reinforcement learning
- bayesian networks
- approximation methods
- variance reduction
- computational complexity
- multi agent
- average reward
- single agent
- evaluation function
- neural network