Login / Signup
Stable Policy Optimization via Off-Policy Divergence Regularization.
Ahmed Touati
Amy Zhang
Joelle Pineau
Pascal Vincent
Published in:
UAI (2020)
Keyphrases
</>
optimization algorithm
global optimization
constrained optimization
optimization methods
optimization problems
stochastic gradient descent
risk minimization
dynamic programming
parameter selection
asymptotically optimal
discrete optimization
direct search
image restoration and reconstruction