Login / Signup
On-Policy Robot Imitation Learning from a Converging Supervisor.
Ashwin Balakrishna
Brijen Thananjeyan
Jonathan Lee
Arsh Zahed
Felix Li
Joseph E. Gonzalez
Ken Goldberg
Published in:
CoRR (2019)
Keyphrases
</>
imitation learning
humanoid robot
robotic systems
maximum margin
reinforcement learning
robot behavior
optimal policy
mobile robot
multi modal
human teacher
feature space
dynamic environments
pattern classification
action selection
markov decision process