Login / Signup
On-Policy Robot Imitation Learning from a Converging Supervisor.
Ashwin Balakrishna
Brijen Thananjeyan
Jonathan Lee
Felix Li
Arsh Zahed
Joseph E. Gonzalez
Ken Goldberg
Published in:
CoRL (2019)
Keyphrases
</>
imitation learning
humanoid robot
robotic systems
maximum margin
reinforcement learning
robot behavior
optimal policy
human teacher
multi modal
autonomous robots
function approximation
reinforcement learning methods