Login / Signup

Approximation Benefits of Policy Gradient Methods with Aggregated States.

Daniel Russo
Published in: Manag. Sci. (2023)
Keyphrases
  • long run
  • queueing networks
  • policy gradient methods
  • natural actor critic
  • state transitions
  • robot arm
  • policy gradient
  • neural network