Login / Signup
Approximation Benefits of Policy Gradient Methods with Aggregated States.
Daniel Russo
Published in:
Manag. Sci. (2023)
Keyphrases
</>
long run
queueing networks
policy gradient methods
natural actor critic
state transitions
robot arm
policy gradient
neural network