Login / Signup
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences.
Alan Chan
Hugo Silva
Sungsu Lim
Tadashi Kozuno
A. Rupam Mahmood
Martha White
Published in:
CoRR (2021)
Keyphrases
</>
kullback leibler
optimization problems
optimal policy
global optimization
evolution strategy
bi directional
optimization process
constrained optimization
kl divergence
bayesian networks
evolutionary algorithm
state space