Login / Signup
Combining Model-Based Design and Model-Free Policy Optimization to Learn Safe, Stabilizing Controllers.
Tyler Westenbroek
Ayush Agrawal
Fernando Castañeda
S. Shankar Sastry
Koushil Sreenath
Published in:
ADHS (2021)
Keyphrases
</>
model free
reinforcement learning
policy iteration
reinforcement learning algorithms
function approximation
average reward
impedance control
optimal policy
temporal difference
policy evaluation
neural network
genetic algorithm