Login / Signup
Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching.
Lantao Yu
Tianhe Yu
Jiaming Song
Willie Neiswanger
Stefano Ermon
Published in:
AAAI (2023)
Keyphrases
</>
imitation learning
reinforcement learning
humanoid robot
robotic systems
maximum margin
real time
probability distribution
support vector machine