Sign in

Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching.

Lantao YuTianhe YuJiaming SongWillie NeiswangerStefano Ermon
Published in: CoRR (2023)
Keyphrases
  • imitation learning
  • robotic systems
  • real time
  • reinforcement learning
  • maximum margin
  • feature selection
  • probability distribution
  • state space