Login / Signup
Learning Optimal Controllers by Policy Gradient: Global Optimality via Convex Parameterization.
Yue Sun
Maryam Fazel
Published in:
CDC (2021)
Keyphrases
</>
globally optimal
global optimality
reinforcement learning
policy gradient
dynamic programming
learning algorithm
learning tasks
piecewise linear
optimal solution
supervised learning
global optimization
image segmentation
sufficient conditions