Login / Signup
Policy Gradient for Reinforcement Learning with General Utilities.
Navdeep Kumar
Kaixin Wang
Utkarsh Pratiush
Kfir Yehuda Levy
Shie Mannor
Published in:
Tiny Papers @ ICLR (2024)
Keyphrases
</>
policy gradient
reinforcement learning
actor critic
utility function
function approximation
policy gradient methods
function approximators
approximation methods
neural network
learning algorithm
reinforcement learning algorithms
machine learning
temporal difference
variance reduction