Trust Region-Guided Proximal Policy Optimization.
Yuhui WangHao HeXiaoyang TanYaozhong GanPublished in: NeurIPS (2019)
Keyphrases
- trust region
- optimization methods
- unconstrained optimization
- line search
- global optimum
- optimization algorithm
- column generation
- optimization method
- global convergence
- optimization problems
- constrained optimization
- feature selection
- levenberg marquardt
- risk minimization
- constraint programming
- step size
- simulated annealing
- cost function