Login / Signup
SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching.
Mingfei Sun
Anuj Mahajan
Katja Hofmann
Shimon Whiteson
Published in:
CoRR (2021)
Keyphrases
</>
imitation learning
robotic systems
reinforcement learning
maximum margin
probability distribution
learning algorithm
state space
support vector machine
social network analysis
relational domains