Approximation Benefits of Policy Gradient Methods with Aggregated States.

Published in: Manag. Sci. (2023)

Keyphrases

long run
queueing networks
policy gradient methods
natural actor critic
state transitions
robot arm
policy gradient
neural network