Hindsight Trust Region Policy Optimization.
Hanbo ZhangSite BaiXuguang LanDavid HsuNanning ZhengPublished in: IJCAI (2021)
Keyphrases
- trust region
- optimization methods
- unconstrained optimization
- line search
- global optimum
- optimization algorithm
- optimization method
- column generation
- optimization problems
- constrained optimization
- least squares
- multi objective
- quadratic programming
- simulated annealing
- conjugate gradient
- training algorithm
- global convergence
- hessian matrix
- integer programming
- levenberg marquardt
- feature selection
- back propagation
- particle swarm optimization
- evolutionary algorithm
- optimal solution