Login / Signup
Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies.
Rui Yuan
Simon S. Du
Robert M. Gower
Alessandro Lazaric
Lin Xiao
Published in:
CoRR (2022)
Keyphrases
</>
policy gradient methods
log linear
natural actor critic
policy gradient
robot arm
log linear models
probabilistic modeling
efficient learning
convergence rate
reinforcement learning
discriminative training
reinforcement learning problems
latent variables