Login / Signup
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods.
Seohong Park
Jaekyeom Kim
Gunhee Kim
Published in:
NeurIPS (2021)
Keyphrases
</>
policy gradient methods
natural actor critic
least squares
policy gradient
multi agent systems
dynamic programming
state space