Login / Signup
Hierarchical Actor-Critic.
Andrew Levy
Robert Platt Jr.
Kate Saenko
Published in:
CoRR (2017)
Keyphrases
</>
actor critic
reinforcement learning
optimal control
policy gradient
approximate dynamic programming
temporal difference
neuro fuzzy
function approximation
policy iteration
gradient method