Login / Signup

Provably Convergent Policy Optimization via Metric-aware Trust Region Methods.

Jun SongNiao HeLijun DingChaoyue Zhao
Published in: CoRR (2023)
Keyphrases
  • optimization methods
  • trust region
  • optimization algorithm
  • provably convergent
  • optimization method
  • learning algorithm
  • single image
  • linear program
  • density estimation