Login / Signup
Imitation Learning via Off-Policy Distribution Matching.
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
Published in:
ICLR (2020)
Keyphrases
</>
imitation learning
robotic systems
maximum margin
reinforcement learning
probability distribution
humanoid robot
real time
learning algorithm
training set
relational databases