On the Convergence Rate of Off-Policy Policy Optimization Methods with Density-Ratio Correction.
Jiawei HuangNan JiangPublished in: CoRR (2021)
Keyphrases
- optimization methods
- convergence rate
- density ratio
- global convergence
- gradient method
- step size
- learning rate
- gravitational search algorithm
- convergence speed
- least squares
- optimization problems
- optimization method
- simulated annealing
- density ratio estimation
- primal dual
- optimal policy
- stochastic methods
- semi parametric
- line search
- particle swarm
- numerical stability
- evolutionary algorithm