Login / Signup
Provably Convergent Policy Optimization via Metric-aware Trust Region Methods.
Jun Song
Niao He
Lijun Ding
Chaoyue Zhao
Published in:
Trans. Mach. Learn. Res. (2023)
Keyphrases
</>
optimization methods
trust region
feature selection
image sequences
search space
simulated annealing
optimization method
metric learning
provably convergent