Login / Signup
Combining Model-Based and Model-Free Methods for Nonlinear Control: A Provably Convergent Policy Gradient Approach.
Guannan Qu
Chenkai Yu
Steven H. Low
Adam Wierman
Published in:
CoRR (2020)
Keyphrases
</>
model free
reinforcement learning algorithms
reinforcement learning
control problems
function approximation
average reward
learning algorithm
control system
supervised learning
shape from shading
optimization methods
policy iteration
gradient method
rl algorithms
policy gradient