Login / Signup
Provably Convergent Policy Optimization via Metric-aware Trust Region Methods.
Jun Song
Niao He
Lijun Ding
Chaoyue Zhao
Published in:
CoRR (2023)
Keyphrases
</>
optimization methods
trust region
optimization algorithm
provably convergent
optimization method
learning algorithm
single image
linear program
density estimation