Login / Signup

On the Linear Convergence of Policy Gradient under Hadamard Parameterization.

Jiacai LiuJinchi ChenKe Wei
Published in: CoRR (2023)
Keyphrases
  • policy gradient
  • convergence rate
  • parametric optimization
  • reinforcement learning
  • model checking
  • convergence speed
  • actor critic