Login / Signup
Imitation Learning via Off-Policy Distribution Matching.
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
Published in:
CoRR (2019)
Keyphrases
</>
imitation learning
reinforcement learning
humanoid robot
maximum margin
machine learning
probability distribution
random variables
learning algorithm
active learning